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Preface 



Science and engineering, our modern ways of understanding and altering 
the world, are said to be about accuracy and precision. Yet we best master 
the complexity of our world by cultivating insight rather than precision. 

We need insight because our minds are but a small part of the world. An 
insight unifies fragments of knowledge into a compact picture that fits in 
our minds. But precision can overflow our mental registers, washing away 
the understanding brought by insight. This book shows you how to build 
insight and understanding first, so that you do not drown in complexity. 

Less Therefore, our approach will not be rigorous — for rigor easily becomes rigor 
rigor mortis or paralysis by analysis. Forgoing rigor, we'll study the natural and 
human-created worlds — the worlds of science and engineering. So you'll 
need some — bu t not extensive! — knowledge of physics concepts such as 
force, power, energy, charge, and field. We'll use as little mathematics as 
possible — algebra and geometry mostly, trigonometry sometimes, and cal- 
culus rarely — so that the mathematics promotes rather than hinders insight, 
understanding, and flexible problem solving. The goal is to help you mas- 
ter complexity; then no problem can intimidate you. 

Like all important parts of our lives, whether spouses or careers, I came to 
this approach mostly unplanned. As a graduate student, I gave my first sci- 
entific talk on the chemical reactions in the retinal rod. I could make sense 
of the chemical chaos only by approximating. In that same year, my friend 
Carlos Brody wondered about the distribution of twin primes — prime pairs 
separated by 2, such as 3 and 5 or 11 and 13. Nobody knows the distribu- 
tion for sure. As a lazy physicist, I approximately answered Carlos's ques- 
tion with a probabilistic model of being prime [32]. Approximations, I saw 
again, foster understanding. 

As a physics graduate student, I needed to prepare for the graduate qualify- 
ing exams. I also became a teaching assistant for the "Order-of-Magnitude 
Physics" course. In three months, preparing for the qualifying exams and 
learning the course material to stay a day ahead of the students, I learned 
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more physics than I had in the years of my undergraduate degree. Physics 
teaching and learning had much room for improvement — and approxima- 
tion and insight could fill the gap. 

Dedi- In gratitude to my teachers, I dedicate this book to Carver Mead for irre- 
cation placeable guidance and faith; and to Peter Goldreich and Sterl Phinney, 
who developed the "Order-of-Magnitude Physics" course at Caltech. From 
them I learned the courage to simplify and gain insight — the courage that 
I look forward to teaching you. 

Organi- For many years, at the University of Cambridge and at MIT, I taught a 
zation course on the "Art of Approximation" organized by topics in physics and 
engineering. This organization limited the material's generality: Unless 
you become a specialist in general relativity, you may not study gravitation 
again. Yet estimating how much gravity deflects starlight (Section 5.3.1) 
teaches reasoning tools that you can use far beyond that example. Tools are 
more general and useful than topics. 

Therefore, I redesigned the course around the reasoning tools. This orga- 
nization, which I have used at MIT and Olin College of Engineering, is re- 
flected in this book — which teaches you one tool per chapter, each selected 
to help you build insight and master complexity. 

There are the two broad ways to master complexity: organize the complex- 
ity or discard it. Organizing complexity, the subject of Part I, is taught 
through two tools: divide-and-conquer reasoning (Chapter 1) and making 
abstractions (Chapter 2). 

Discarding complexity (Parts II and III) illustrates that "the art of being 
wise is the art of knowing what to overlook" (William James [24, p. 369]). 
In Part II, complexity is discarded without losing information. This part 
teaches three reasoning tools: symmetry and conservation (Chapter 3), pro- 
portional reasoning (Chapter 4), and dimensional analysis (Chapter 5). In 
Part III, complexity is discarded while losing information. This part teaches 
our final tools: lumping (Chapter 6), probabilistic reasoning (Chapter 7), 
easy cases (Chapter 8), and spring models (Chapter 9). 

Finding Using these tools, we will explore the natural and human-made worlds. We 
meaning will estimate the flight range of birds and planes, the strength of chemical 
bonds, and the angle that the Sun deflects starlight; understand the physics 
of pianos, xylophones, and speakers; and explain why skies are blue and 
sunsets are red. Our tools weave these and many other examples into a 
tapestry of meaning spanning science and engineering. 
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Sharing Like my earlier Street-Fighting Mathematics [33], this book is licensed under a 
this work Creative Commons Attribution-Noncommercial-Share Alike license. MIT 
Press and I hope that you will improve and share the work noncommer- 
cially, and we would gladly receive corrections and suggestions. 



Inter- The most effective teacher is a skilled tutor [2]. A tutor asks many questions, 
spersed because questioning, wondering, and discussing promote learning. Ques- 
ques- tions of two types are interspersed through the book. Questions marked with 
tions a in the margin, which a tutor would pose during a tutorial, ask you to de- 
velop the next steps of an argument. They are answered in the subsequent 
text, where you can check your thinking. Numbered problems, marked with 
a shaded background, which a tutor would give you to take home, ask you 
to practice the tool, to extend an example, to use several tools, and even to 
resolve an occasional paradox. Merely watching workout videos produces 
little fitness! So, try many questions of both types. 

Improve Through your effort, mastery will come — and with a broad benefit. As the 
our physicist Edwin Jaynes said of teaching [25]: 

world [T]he goal should be, not to implant in the students' mind every fact that the 
teacher knows now; but rather to implant a way of thinking that enables the 
student, in the future, to learn in one year what the teacher learned in two years. 
Only in that way can we continue to advance from one generation to the next. 

May the tools in this book help you advance our world beyond the state in 
which my generation has left it. 
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Part I 



Organizing complexity 



We cannot find much insight staring at a mess. We need to organize it. As 
an everyday example, when I look at my kitchen after a dinner party, I feel 
overwhelmed. It's late. I'm tired, and I dread that I will not get enough 
sleep. If I clean up in that scattered state of mind, I pick up a spoon here 
and a pot there, making little progress. However, when I remember that a 
large problem can be broken into smaller ones, calm and efficiency return. 
I begin at one corner of the kitchen, clear its mess, and move to neighboring 
areas until the project is done. I divide and conquer (Chapter 1). 

Once the dishes are clean, I resist the temptation to dump them into one 
big box. I separate pots from the silverware and, within the silverware, the 
forks from the spoons. These groupings, or abstractions (Chapter 2), make 
the kitchen easy to understand and use. 

In problem solving, we organize complexity by using divide-and-conquer 
reasoning and by making abstractions. In Part I, you'll learn how. 
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As imperial rulers knew, you need not conquer all your enemies at once. 
Instead, conquer them one at a time. Break hard problems into manageable 
pieces. This process embodies our first reasoning tool: Divide and conquer! 



1.1 Warming up 

To show how to use divide-and-conquer reasoning, we'll apply it to increas- 
ingly complex problems that illustrate its essential features. So we start 
with an everyday estimate. 

What is, roughly, the volume of a dollar bill? 

Volumes are hard to estimate. However, we should still make a quick guess. 
Even an inaccurate guess will help us practice courage and, when we com- 
pare the guess with a more accurate estimate, will help us calibrate our inter- 
nal measuring rods. To urge me on, I often imagine a mugger who holds a 
knife at my ribs, demanding, "Your guess or your life!" Then I judge it likely 
that the volume of a dollar bill lies between 0.1 and 10 cubic centimeters. 
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1 Divide and conquer 



This range is wide, spanning a factor of 100. In contrast, the dollar bill's 
width probably lies between 10 and 20 centimeters — a range of only a factor 
of 2. The volume range is wider than the width range because we have 
no equivalent of a ruler for volume; thus, volumes are less familiar than 
lengths. Fortunately, the volume of the dollar bill is the product of lengths. 

volume = width x height x thickness. (1.1) 

The harder volume estimate becomes three easier length 
estimates — the benefit of divide-and-conquer reasoning. 

The width looks like 6 inches, which is roughly 15 cen- 
timeters. The height looks like 2 or 3 inches, which is roughly 6 centimeters. 
But before estimating the thickness, let's talk about unit systems. 

Is it better to use metric or US customary units (such as inches, feet, and miles)? 

Your estimates will be more accurate if you use the units most familiar to 
you. Raised in the United States, I judge lengths more accurately in inches, 
feet, and miles than in centimeters, meters, or kilometers. However, for 
calculations requiring multiplication or division — most calculations — I con- 
vert the customary units to metric (and often convert back to customary 
units at the end). But you may be fortunate enough to think in metric. Then 
you can estimate and calculate in a single unit system. 

The third piece of the divide-and-conquer estimate, the thickness, is diffi- 
cult to judge. A dollar bill is thin — paper thin. 

But how thin is " paper thin"? 

This thickness is too small to grasp and judge easily. However, a stack of 
several hundred bills would be graspable. Not having that much cash lying 
around. I'll use paper. A ream of paper, which has 500 sheets, is roughly 
5 centimeters thick. Thus, one sheet of paper is roughly 0.01 centimeters 
thick. With this estimate for the thickness, the volume is approximately 
1 cubic centimeter: 

volume « 15 cm x 6 cm x 0.01cm « 1cm 3 . (1.2) 

width height thickness 

Although a more accurate calculation could adjust for the fiber composi- 
tion of a dollar bill compared to ordinary paper and might consider the 
roughness of the paper, these details obscure the main result: A dollar bill 
is 1 cubic centimeter pounded paper thin. 



6 cm 


$1 bill 




15 cm 
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To check this estimate, I folded a dollar bill until my finger strength gave 
out, getting a roughly cubical packet with sides of approximately 1 centime- 
ter — making a volume of approximately 1 cubic centimeter! 

In the preceding analysis, you may have noticed the = and « symbols and 
their slightly different use. Throughout this book, our goal is insight over 
accuracy. So we'll use several kinds of equality symbols to describe the 
accuracy of a relation and what it omits. Here is a table of the equality 
symbols, in descending order of completeness and often increasing order 
of usefulness. 



= equality by definition 


read as "is defined to be" 


= equality 


"is equal to" 


« equality except perhaps for a purely 
numerical factor near 1 


"is approximately equal to" 


~ equality except perhaps for a purely 


"is roughly equal to" or 


numerical factor 


"is comparable to" 


oc equality except perhaps for a factor 
that may have dimensions 


"is proportional to" 



As examples of the kinds of equality, for the circle below, A = nr 1 , and 
A « 4r 2 , and A ~ r 2 . For the cylinder, V ~ hr 2 — which implies V oc r 2 
and V oc h. In the V oc h form, the factor hidden in the oc symbol has 
dimensions of length squared. 




Problem 1.1 Weight of a box of books 

How heavy is a small moving-box filled with books? 

Problem 1.2 Mass of air in your bedroom 

Estimate the mass of air in your bedroom. 

Problem 1.3 Suitcase of bills 

In the movies, and perhaps in reality, cocaine and elections are bought with a suit- 
case of $100 bills. Estimate the dollar value in such a suitcase. 





6 



1 Divide and conquer 



Problem 1.4 Gold or bills? 

As a bank robber sitting in the vault planning your getaway, do you fill your suit- 
case with gold bars or $100 bills? Assume first that how much you can carry is a 
fixed weight. Then redo your analysis assuming that how much you can carry is 
a fixed volume. 



1.2 Rails versus roads 

We are now warmed up and ready to use divide-and-conquer reasoning for 
more substantial estimates. Our next estimate, concerning traffic, comes to 
mind whenever I drive the congested roads to JFK Airport in New York 
City. The route goes on the Van Wyck Expressway, which was planned by 
Robert Moses. As Moses's biographer Robert Caro describes [6, pp. 904ff], 
when Moses was in charge of building the expressway, the traffic planners 
recommended that, in order to handle the expected large volume of traffic, 
the road include a train line to the then-new airport. Alternatively, if build- 
ing the train track would be too expensive, they recommended that the city, 
when acquiring the land for the road, still take an extra 50 feet of width and 
reserve it as a median strip for a train line one day. Moses also rejected the 
cheaper proposal. Alas, only weeks after its opening, not long after World 
War Two, the rail-free highway had reached peak capacity. 

Let's use our divide-and-conquer tool to compare, for rush-hour commut- 
ing, the carrying capacities of rail and road. The capacity is the rate at which 
passengers are transported; it is passengers per time. First we'll estimate 
the capacity of one lane of highway. We can use the 2-second-following rule 
taught in many driving courses. You are taught to leave 2 seconds of travel 
time between you and the car in front. When drivers follow this rule, a sin- 
gle lane of highway carries one car every 2 seconds. To find the carrying 
capacity, we also need the occupancy of each car. Even at rush hour, at least 
in the United States, each car carries roughly one person. (Taxis often have 
two people including the driver, but only one person is being transported 
to the destination.) Thus, the capacity is one person every 2 seconds. As an 
hourly rate, the capacity is 1800 people per hour: 

1 person 3600 g! 1800 people 

X = . (1.3) 

2 g 1 hr hr 

The diagonal strike-through lines help us to spot which units cancel and to 
check that we end up with just the units that we want (people per hour). 
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This rate, 1800 people per hour, is approximate, because the 2-second fol- 
lowing rule is not a law of nature. The average gap might be 4 seconds late 
at night, 1 second during the day, and may vary from day to day or from 
highway to highway. But a 2-second gap is a reasonable compromise esti- 
mate. Replacing the complex distribution of following times with one time 
is an application of lumping — the tool discussed in Chapter 6. Organizing 
complexity almost always reduces detail. If we studied all highways at all 
times of day, the data, were we so unfortunate as to obtain them, would 
bury any insight. 

How does the capacity of a single lane of highway compare with the capacity of a 
train line? 

For the other half of the comparison, we'll estimate the rush-hour capacity 
of a train line in an advanced train system, say the French or German system. 
As when we estimated the volume of a dollar bill (Section 1.1), we divide the 
estimate into manageable pieces: how often a train runs on the track, how 
many cars are in each train, and how many passengers are in each car. Here 
are my armchair estimates for these quantities, kept slightly conservative to 
avoid overestimating the train-line's capacity. A single train car, when full 
at rush hour, may carry 150 people. A rush-hour train may consist of 20 cars. 
And, on a busy train route, a train may run every 10 minutes or six times 
per hour. Therefore, the train line's capacity is 18 000 people per hour: 

150 people 20 .cars' 6 trains' 18 000 people 

x x = . (1.4) 

.car" .trairf hr hr 

This capacity is ten times the capacity of a single fast-flowing highway lane. 
And this estimate is probably on the low side; Robert Caro [6, p. 901] gives 
an estimate of 40 000 to 50 000 people per hour. Using our lower rate, one 
train track in each direction could replace two highways even if each high- 
way had five lanes in each direction. 

1.3 Tree representations 

Our estimates for the volume of a dollar bill (Section 1.1) and for the rail 
and highway capacities (Section 1.2) used the same method: dividing hard 
problems into smaller ones. However, the structure of the analysis is buried 
within the sentences, paragraphs, and pages. The sequential presentation 
hides the structure. Because the structure is hierarchical — big problems 
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1 Divide and conquer 



split, or branch, into smaller problems — its most compact representation is 
a tree. A tree representation shows us the analysis in one glance. 



Here is the tree representation for the capacity 
of a train line. Unlike the biological variety, our 
trees stand on their head. Their roots, the goals, 
sit at the top of the tree. Their leaves, the small 
problems into which we have subdivided the 
goal, sit at the bottom. The orientation matches 
the way that we divide and conquer, filling the 
page downward as we subdivide. 

In making this first tree, we haven't estimated 
the quantities themselves. We have only identi- 
fied the quantities. The question marks remind 
us of our next step: to include estimates for the 
three leaves. These estimates were 150 people 
per car, 20 cars per train, and 6 trains per hour 
(giving the tree in the margin). 



capacity 




?? people ?? cars ?? trains 

car train hour 



capacity 




150 people 20 cars 6 trains 

car train hour 



Then we multiplied the leaf values to propagate 
the estimates upward from the leaves toward 
the root. The result was 18 000 people per hour. 
The completed tree shows us the entire estimate 
in one glance. 



capacity 

18000 people/hour 




150 people 20 cars 

This train-capacity tree had the simplest possi- ~ train 

ble structure with only two layers (the root layer 
and, as the second layer, the three leaves). The 

next level of complexity is a three-layer tree, which will represent our esti- 
mate for the volume of a dollar bill. It started as a two-layer tree with three 
leaves. 



6 trains 
hour 



volume 




width height thickness 



Then it grew, because, unlike the width and height, the thickness was diffi- 
cult to estimate just by looking at a dollar bill. Therefore, we divided that 
leaf into two easier leaves. 
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The result is the tree in the margin. The thick- 
ness leaf, which is the thickness per sheet, has 
split into (1) the thickness per ream and (2) 
the number of sheets per ream. The boxed 

— Ion the line connecting the thickness to the 

number of sheets per ream is a new and use- 
ful notation. The —1 tells us the exponent to 
apply to that leaf value when we propagate it 
upward to the root. thickness ?? sheets 

Here is why I write the — 1 as a full-sized num- ream ream 

ber rather than a small superscript. Most of 

our estimates require multiplying several factors. The only question for 
each factor is, "With what exponent does this factor enter?" The number 

— 1 directly answers this "What exponent?" question. (To avoid cluttering 
the tree, we don't indicate the most-frequent exponent of 1.) 

This new subtree then represents the following equation for the thickness 
of one sheet: 

. , thickness ( ?? sheets 

thickness = x 

ream \ ream 

The —1 exponent allows, at the cost of a slight complication in the tree no- 
tation, the leaf to represent the number of sheets per ream rather than a 
less-familiar fraction, the number of reams per sheet. 

Now we include our estimates for the leaf values. The width is 15 centime- 
ters. The height is 6 centimeters. The thickness of a ream of paper is 5 cen- 
timeters. And a ream contains 500 sheets of paper. The result is the follow- 
ing tree. 





volume 




ream 



ream 
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1 Divide and conquer 



Now we propagate the values to the root. 

The two bottommost leaves combine to tell 
us that the thickness of one sheet is 1(T 2 
centimeters. This thickness completes the 
tree's second layer. In the second layer, the 
three nodes tell us that the volume of a dol- 
lar bill — the root — is 1 cubic centimeter. 

With practice, you can read in this final tree 
all the steps of the analysis. The three nodes 
in the second layer show how the difficult 
volume estimate was subdivided into three 
easier estimates. That the width and height 
remained leaves indicates that these two es- 
timates felt reliable enough. In contrast, the 
the thickness indicate that the thickness was still hard to estimate, so we 
divided that estimate into two more-familiar quantities. 

The tree encapsulates many paragraphs of analysis in a compact form, one 
that our minds can absorb in a single glance. Organizing complexity helps 
us build insight. 

Problem 1.5 Tree for the suitcase of bills 

Make a tree diagram for your estimate in Problem 1.3. Do it in three steps: (1) Draw 
the tree without any leaf estimates, (2) estimate the leaf values, and (3) propagate 
the leaf values upward to the root. 



volume 
1 cm 3 
+ 



width height thickness 
15 cm 6 cm 10~ 2 cm 



5 cm thickness 



ream 



-1 

\ 

500 sheets 



two branches sprouting from 



1.4 Demand-side estimates 

Our analysis of the carrying capacity of highways and railways (Section 1.2) 
is an example of a frequent application of estimation in the social world — es- 
timating the size of a market. The highway-railway comparison proceeded 
by estimating the transportation supply. In other problems, a more feasi- 
ble analysis is based on the complementary idea of estimating the demand. 
Here is an example. 

How much oil does the United States import (in barrels per year)? 

The volume rate is enormous and therefore hard to picture. Divide-and-con- 
quer reasoning will tame the complexity. Just keep subdividing until the 
quantities are no longer daunting. 
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Here, subdivide the demand — the consumption. We consume oil in so 
many ways; estimating the consumption in each pathway would take a long 
time without producing much insight. Instead, let's estimate the largest 
consumption — likely to be cars — then adjust for other uses and for overall 
consumption versus imports. 

•alkusagtr imports 

imports = xar-usage x x . (1.6) 

oar-usage all-usage 

Here is the corresponding tree. The first fac- 
tor, the most difficult of the three to estimate, 
will require us to sprout branches and make 
a subtree. The second and third factors might 
be possible to estimate without subdividing. 

Now we must decide how to continue. 

Should we keep subdividing until we've built the 
entire tree and only then estimate the leaves, or should we try to estimate these 
leaves and then subdivide what we cannot estimate? 

It depends on one's own psychology. I feel anxious in the uncharted wa- 
ters of a new estimate. Sprouting new branches before making any leaf esti- 
mates increases my anxiety. The tree might never stop sprouting branches 
and leaves, and I'll never estimate them all. Thus, I prefer to harvest my 
progress right away by estimating the leaves before sprouting new branches. 
You should experiment to learn your psychology. You are your best prob- 
lem-solving tool, and it is helpful to know your tools. 

Because of my psychology. I'll first estimate a leaf quantity: 
all usage 

— • (1-7) 

car usage 

But don't do this estimate directly. It is more intuitive — that is, easier for 
our gut — to estimate first the ratio of car usage to other (noncar) usage. The 
ability to make such comparisons between disjoint sets, at least for physi- 
cal objects, is hard wired in our brains and independent of the ability to 
count. Not least, it is not limited to humans. The female lions studied by 
Karen McComb and her colleagues [35] would judge the relative size of 
their troop and a group of lions intruding on their territory. The females 
would approach the intruders only when they outnumbered the intruders 
by a large-enough ratio, roughly a factor of 2. 



imports 



car usage 



all usage 
car usage 



imports 
all usage 
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Other uses for oil include noncar modes of transport (trucks, trains, and 
planes), heating and cooling, and hydrocarbon-rich products such as fer- 
tilizer, plastics, and pesticides. In judging the relative importance of other 
uses compared to car usage, two arguments compete: (1) Other uses are so 
many and so significant, so they are much more important than car usage; 
and (2) cars are so ubiquitous and such an inefficient mode of transport, so 
car usage is much larger than other uses. To my gut, both arguments feel 
comparably plausible. My gut is telling me that the two categories have 
comparable usages: 

other usage 

— « 1. (1.8) 

car usage 



Based on this estimate, all usage (the sum of car and other usage) is roughly 
double the car usage: 
all usage 

— « 2. (1.9) 

car usage 



This estimate is the first leaf. It implicitly assumes that the gasoline fraction 
in a barrel of oil is high enough to feed the cars. Fortunately, if this assump- 
tion were wrong, we would get warning. For if the fraction were too low, 
we would build our transportation infrastructure around other means of 
transport — such as trains powered by electricity generated by burning the 
nongasoline fraction in oil barrels. In this probably less-polluted world, we 
would estimate how much oil was used by trains. 



Returning to our actual world, let's estimate the second leaf: 
imports 
all usage 



( 1 . 10 ) 



This adjustment factor accounts for the fact that only a portion of the oil 
consumed is imported. 



What does your gut tell you for this fraction? 



Again, don't estimate this fraction directly. Instead, to make a comparison 
between disjoint sets, first compare (net) imports with domestic production. 
In estimating this ratio, two arguments compete. On the one hand, the 
US media report extensively on oil production in other countries, which 
suggests that oil imports are large. On the other hand, there is also extensive 
coverage of US production and frequent comparison with countries such as 
Japan that have almost no domestic oil. My resulting gut feeling is that the 
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categories are comparable and therefore that imports are roughly one-half 
of all usage: 

imports imports 1 



domestic production 



1 



so 



all usage 2 



(l.ll) 



This leaf, as well as the other adjustment factor, are dimensionless numbers. 
Such numbers, the main topic of Chapter 5, have special value. Our percep- 
tual system is skilled at estimating dimensionless ratios. Therefore, a leaf 
node that is a dimensionless ratio probably does not need to be subdivided. 

The tree now has three leaves. Having plausi- 
ble estimates for two of them should give us 
courage to subdivide the remaining leaf, the 
total car usage, into easier estimates. That leaf 
will sprout its own branches and become an 
internal node. 

2 0.5 

How should we subdivide the car usage? 



imports 




A reasonable subdivision is into the number of cars N cars and car usage 

the per-car usage. Both quantities are easier to estimate than 
the root. The number of cars is related to the US population — a 
familiar number if you live in the United States. The per-car 
usage is easier to estimate than is the total usage of all US N cats usage/car 
cars. Our gut can more accurately judge human-scale quan- 
tities, such as the per-car usage, than it can judge vast numbers like the 
total usage of all US cars. 

For the same reason, let's not estimate the number of cars 
directly Instead, subdivide this leaf into two leaves: 

1. the number of people, and 

2. the number of cars per person. 

The first leaf is familiar, at least to residents of the United 
States: N people « 3 x 10 8 . 

The second leaf, cars per person, is a human-sized quantity 
In the United States, car ownership is widespread. Many adults own more 
than one car, and a cynic would say that even babies seem to own cars. 
Therefore, a rough and simple estimate might be one car per person — far 
easier to picture than the total number of cars! Then N cars ~ 3 x 10 8 . 



N C ars 

3 x 10 8 




Npeople cars /person 

3 x 10 s 1 





14 



1 Divide and conquer 



The per-car usage can be subdivided into three 
easier factors (leaves). Here are my estimates. 

1. How many miles per car year? Used cars with 
10 000 miles per year are considered low use 
but are not rare. Thus, for a typical year of 
driving, let's take a slightly longer distance: 
say, 20 000 miles or 30 000 kilometers. 

2. How many miles per gallon? A typical car fuel efficiency is 30 miles per 
US gallon. In metric units, it is about 100 kilometers per 8 liters. 

3. How many gallons per barrel? You might have seen barrels of asphalt 
along the side of the highway during road construction. Following our 
free-association tradition of equating the thickness of a sheet of paper 
and of a dollar bill, perhaps barrels of oil are like barrels of asphalt. 

Their volume can be computed by divide-and-conquer 
reasoning. Just approximate the cylinder as a rectangu- 
lar prism, estimate its three dimensions, and multiply: 

volume ~ lm x 0.5 m x 0.5 m = 0.25 m 3 . (1.12) 

height width depth 

A cubic meter is 1000 liters or, using the conversion of 
4 US gallons per liter, roughly 250 gallons. Therefore, 

0.25 cubic meters is roughly 60 gallons. (The official vol- 
ume of a barrel of oil is not too different at 42 gallons.) 

Multiplying these estimates, and not forgetting the effect of the two —1 ex- 
ponents, we get approximately 10 barrels per car per year (also written as 
barrels per car year): 

2 x 10 4 .miles' 1 gallon' 1 barrel 10 barrels 

x x « . (1.13) 

car year 30 jniles' 60 .gallons' car year 

In doing this calculation, first evaluate the units. The gallons and miles 
cancel, leaving barrels per year. Then evaluate the numbers. The 30 x 60 in 
the denominator is roughly 2000. The 2 x 10 4 from the numerator divided 
by the 2000 from the denominator produces the 10. 

This estimate is a subtree in the tree representing total car usage. The car 
usage then becomes 3 billion barrels per year: 

3 x 10 8 cars x ^ barrels _ 3 x 10 9 barrels 




usage /car 




?? miles ?? miles ?? gallons 

car year gallon barrel 



jcaf year 



year 



(1.14) 
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This estimate is itself a subtree in the tree 
representing oil imports. Because the two 
adjustment factors contribute a factor of 
2 x 0.5, which is just 1, the oil imports are 
also 3 billion barrels per year. 

Here is the full tree, which includes the car usage 
subtree for the total car usage of oil: 3 x io 9 barrels 

year 

imports 
3 x 10 9 barrels 



imports 
3 x 10 9 barrels 
year 



all usage 
car usage 
2 



imports 
all usage 
0.5 



car usage 
3 x 10 9 barrels 
year 



year 



all usage imports 

car usage all usage 

2 0.5 



^people 

3 X 10 8 



Nears 
3x 10 8 



1 car 
person 



usage /car 
10 barrels 
car year 



20 000 miles 
car year 




30 miles 
gallon 



60 gallons 
barrel 




height width depth 250 gallons 

1 m 0.5 m 0.5 m m 3 



Problem 1.6 Using metric units 

As practice with metric units (if you grew up in a nonmetric land) or to make the 
results more familiar (if you grew up in a metric land), redo the calculation using 
the metric values for the volume of a barrel, the distance a car is driven per year, 
and the fuel consumption of a typical car. 

How close is our estimate to official values? 
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For the US oil imports, the US Department of Energy reports 9.163 million 
barrels per day (for 2010). When I first saw this value, my heart sank twice. 
The first shock was the 9 in the 9 million. I assumed that it was the number 
of billions, and wondered how the estimate of 3 billion barrels could be 
a factor of 3 too small. The second shock was the "million" — how could 
the estimate be more than a factor of 100 too large? Then the "per day" 
reassured me. As a yearly rate, 9.163 million barrels per day is 3.34 billion 
barrels per year — only 10 percent higher than our estimate. Divide and 
conquer triumphs! 

Problem 1.7 Fuel efficiency of a 747 

Based on the cost of a long-distance plane ticket, estimate the following quantities: 

(a) the fuel efficiency of a 747, in passenger miles per gallon or passenger kilome- 
ters per liter; and (b) the volume of its fuel tank. Check your estimates against the 
technical data for a 747. 



1.5 Multiple estimates for the same quantity 

After making an estimate, it is natural to wonder about how much confi- 
dence to place in it. Perhaps we made an embarrassingly large mistake. The 
best way to know is to estimate the same quantity using another method. 
As an everyday example, let's observe how we add a list of numbers. 

12 

15 



We often add the numbers first from top to bottom. For 12 + 15 + 18, we 
calculate, "12 plus 15 is 27; 27 plus 18 is 45." To check the result, we add the 
numbers in the reverse order, from bottom to top: "18 plus 15 is 33; 33 plus 
12 is 45." The two totals agree, so each is probably correct: The calculations 
are unlikely to contain an error of exactly the same amount. This kind of 
redundancy catches errors. 

In contrast, mindless redundancy offers little protection. If we check the 
calculation by adding the numbers from top to bottom again, we usually 
repeat any mistakes. Similarly, rereading written drafts usually means over- 
looking the same spelling, grammar, or logic faults. Instead, stuff the draft 
in a drawer for a week, then look at it; or ask a colleague or friend — in both 
cases, use fresh eyes. 
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Reliability, in short, comes from intelligent redundancy. 

This principle helps you make reliable estimates. First, use several meth- 
ods to estimate the same quantity. Second, make the methods as differ- 
ent from one another as possible — for example, by using unrelated back- 
ground knowledge. This approach to reliability is another example of di- 
vide-and-conquer reasoning: The hard problem of making a reliable esti- 
mate becomes several simpler subproblems, one per estimation method. 

You saw an example in Section 1.1, where we estimated the volume of a dol- 
lar bill. The first method used divide-and-conquer reasoning based on the 
width, height, and thickness of the bill. The check was a comparison with a 
folded-up dollar bill. Both methods agreed on a volume of approximately 
1 cubic centimeter — giving us confidence in the estimate. 

For another example of using multiple methods, return to the estimate of 
the volume of an oil barrel (Section 1.4). We used a roadside asphalt barrel 
as a proxy for an oil barrel and estimated the volume of the roadside bar- 
rel. The result, 60 gallons, seemed plausible, but maybe oil barrels have a 
completely different size. One way to catch that kind of error is to use a 
different method for estimating the volume. For example, we might start 
with the cost of a barrel of oil — about $100 in 2013 — and the cost of a gallon 
of gasoline — about $2.50 before taxes, or 1 /40th of the cost of a barrel. If 
the markup on gasoline is not significant, then a barrel is roughly 40 gal- 
lons. Even with a markup, we can still say that a barrel is at least 40 gallons. 
Because our two estimates, 60 gallons and > 40 gallons, roughly agree, our 
confidence in both increases. If they had contradicted each other, one or 
both would be wrong, and we would look for the mistaken assumption, for 
the incorrect arithmetic, or for a third method. 



1.6 Talking to your gut 



As you have seen in the preceding examples, divide-and-con- us population 
quer estimates require reasonable estimates for the leaf quan- 
tities. To decide what is reasonable, you have to talk to your 
gut — what you will learn in this section. Talking to your gut 
feels strange at first, especially because science and engineer- 
ing are considered cerebral subjects. Let's therefore discuss 
how to hold the conversation. The example will be an esti- 
mate of the US population based on its area and population density. The 




population 

density 
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divide-and-conquer tree has two leaves. (In Section 6.3.1, you'll see a qual- 
itatively different method, where the two leaves will be the number of US 
states and the population of a typical state.) 

The area is the width times the height, so the area leaf 
itself splits into two leaves. Estimating the width and 
height requires only a short dialogue with the gut, at 
least if you live in the United States. Its width is a 6-hour 
plane flight at 500 miles per hour, so about 3000 miles; 
and the height is, as a rough estimate, two-thirds of the 
width, or 2000 miles. Therefore, the area is 6 million 
square miles: 

3000 miles x 2000 miles = 6 x 10 6 miles 2 . ( 1 . 16 ) 

In metric units, it is about 16 million square kilometers. 

Estimating the population density requires talking to your gut. If you are 
like me you have little conscious knowledge of the population density. Your 
gut might know, but you cannot ask it directly. The gut is connected to the 
right brain, which doesn't have language. Although the right brain knows 
a lot about the world, it cannot answer with a value, only with a feeling. 
To draw on its knowledge, ask it indirectly. Pick a particular population 
density — say, 100 people per square mile — and ask the gut for its opinion: 
"O, my intuitive, insightful, introverted right brain: What do you think of 
100 people per square mile for the population density?" A response, a gut 
feeling, will come back. Keep lowering the candidate value until the gut 
feeling becomes, "No, that value feels way too low." 

Here is the dialogue between my left brain (LB) and right brain (RB). 

LB: What do you think of 100 people per square mile? 

RB: That feels okay based on my experience growing up in the United States. 

LB: I can probably support that feeling quantitatively. A square mile with 100 
people means each person occupies a square whose side is l/10th of a mile or 
160 meters. Expressed in this form, does the population density feel okay? 

RB: Yes, the large open spaces in the western states probably compensate for the 
denser regions near the coasts. 

LB: Now I will lower the estimate by factors of 3 or 10 until you object strongly 
that the estimate feels too low. [A factor of 3 is roughly one-half of a factor of 10, 
because 3 x 3 «; 10. A factor of 3 is the next-smallest factor by which to move 
when a factor of 10 is too large a jump.] In that vein, what about an average 
population density of 10 people per square mile? 



US population 



population 

density 



width height 
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RB: I feel uneasy. The estimate feels a bit low. 

LB: I understand where you are coming from. That value may moderately over- 
estimate the population density of farmland, but it probably greatly underesti- 
mates the population density in the cities. Because you are uneasy, let's move 
more slowly until you object strongly. How about 3 people per square mile? 

RB: If the true value were lower than that, I'd feel fairly surprised. 

LB: So, for the low end. I'll stop at 3 people per square mile. Now let's navigate 
to the upper end. You said that 100 people per square mile felt plausible. How 
do you feel about 300 people per square mile? 

RB: I feel quite uneasy. That estimate feels quite high. 

LB: I hear you. Your response reminds me that New Jersey and the Netherlands, 
both very densely populated, are at 1000 people per square mile, although I 
couldn't swear to this value. I cannot imagine packing the whole United States 
to a density comparable to New Jersey's. Therefore, let's stop here: Our upper 
endpoint is 300 people per square mile. 

How do you make your best guess based on these two endpoints? 

A plausible guess is to use their arithmetic mean, which is roughly 150 peo- 
ple per square mile. However, the right method is the geometric mean: 

best guess = flower endpoint x upper endpoint. (1.17) 

The geometric mean is the midpoint of the lower and upper bounds — but 
on a ratio or logarithmic scale, which is the scale built into our mental hard- 
ware. (For more about how we perceive quantity, see The Number Sense 
[9].) The geometric mean is the correct mean when combining quantities 
produced by our mental hardware. 

Here, the geometric mean is 30 people per square mile: a factor of 10 re- 
moved from either endpoint. Using that population density, 

US population ~ 6 x 10 6 jaHeS 2 ' x — ^ ~ 2x 10 8 . (1.18) 

F y jnileS 2 ' 

The actual population is roughly 3xl0 8 . The estimate based almost entirely 
on gut reasoning is within a factor of 1.5 of the actual population — a pleas- 
antly surprising accuracy. 

Problem 1.8 More gut estimates 

By asking your gut to help you estimate the lower and upper endpoints, estimate 
(a) the height of a nearby tall tree that you can see, (b) the mass of a car, and (c) the 
number of water drops in a bathtub. 
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1.7 Physical estimates 

Your gut understands not only the social world but also the physical world. 
If you trust its feelings, you can tap this vast reservoir of knowledge. For 
practice, we'll estimate the salinity of seawater (Section 1.7.1), human power 
output (Section 1.7.2), and the heat of vaporization of water (Section 1.7.3). 

1.7.1 Salinity of seawater 

To estimate the salinity of seawater, which will later help you estimate the 
conductivity of seawater (Problem 8.10), do not ask your gut directly: "How 
do you feel about, say, 200 millimolar?" Although that kind of question 
worked for estimating population density (Section 1.6), here, unless you 
are a chemist, the answer will be: "I have no clue. What is a millimolar 
anyway? I have almost no experience of that unit." Instead, offer your gut 
concrete data — for example, from a home experiment: adding salt to a cup 
of water until the mixture tastes as salty as the ocean. 

This experiment can be a thought or a real experiment — another example 
of using multiple methods (Section 1.5). As a thought experiment, I ask 
my gut about various amounts of salt in a cup of water. When I propose 
adding 2 teaspoons, it responds, "Disgustingly salty!" At the lower end, 
when I propose adding 0.5 teaspoons, it responds, "Not very salty." I'll use 
0.5 and 2 teaspoons as the lower and upper endpoints of the range. Their 
midpoint, the estimate from the thought experiment, is 1 teaspoon per cup. 

I tested this prediction at the kitchen sink. With 1 teaspoon (5 milliliters) of 
salt, the cup of water indeed had the sharp, metallic taste of seawater that 
I have gulped after being knocked over by large waves. A cup of water is 
roughly one-fourth of a liter or 250 cubic centimeters. By mass, the resulting 
salt concentration is the following product: 

T-tsp-saif JLcup-watgf 5.xmWa1f 2 g salt 

X X X (1.19) 

ILcup water 250 g water jLtsp-saif 1 xnhsatf. 

i^salt 

The density of 2 grams per cubic centimeter comes from my gut feeling that 
salt is a light rock, so it should be somewhat denser than water at 1 gram 
per cubic centimeter, but not too much denser. (For an alternative method, 
more accurate but more elaborate, try Problem 1.10.) Then doing the arith- 
metic gives a 4 percent salt-to-water ratio (by mass). 
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The actual salinity of the Earth's oceans is about 3.5 percent — very close to 
the estimate of 4 percent. The estimate is close despite the large number 
of assumptions and approximations — the errors have mostly canceled. Its 
accuracy should give you courage to perform home experiments whenever 
you need data for divide-and-conquer estimates. 

Problem 1.9 Density of water 

Estimate the density of water by asking your gut to estimate the mass of water in 
a cup measure (roughly one-quarter of a liter). 

Problem 1.10 Density of salt 

Estimate the density of salt using the volume and mass of a typical salt container 
that you find in a grocery store. This value should be more accurate than my gut 
estimate in Section 1.7.1 (which was 2 grams per cubic centimeter). 



1.7.2 Human power output 



Our second example of talking to your gut is an estimate of hu- 
man power output — a power that is useful in many estimates 
(for example. Problem 1.17). Energies and powers are good can- 
didates for divide-and-conquer estimates, because they are con- 
nected by the subdivision shown in the following equation and 
represented in the tree in the margin: 

energy 

power = . 

time 



power 




energy 



( 1 . 20 ) 



In particular, let's estimate the power that a trained athlete can generate for 
an extended time (not just during a few-seconds-long, high-power burst). 
As a proxy for that power. I'll use my own burst power output with two 
adjustment factors: 



athlete's steady power 
^ I 



my steady power athlete s steady power 

my burst power 

my burst power my steady power 




Maintaining a power is harder than producing a quick burst. Therefore, 
the first adjustment factor, my steady power divided by my burst power, 
is somewhat smaller than 1 — maybe 1 /2 or 1 /3. In contrast, an athlete's 



l 

\ 

time 
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power output will be higher than mine, perhaps by a factor of 2 or 3: Even 
though I am sometimes known as the street-fighting mathematician [33], I 
am no athlete. Then the two adjustment factors roughly cancel, so my burst 
power should be comparable to an athlete's steady power. 



To estimate my burst power, I performed a home experiment 
of running up a flight of stairs as quickly as possible. Deter- 
mining the power output requires estimating an energy and 
a time: 

energy 

power = — . (1.21) 

time 

The energy, which is the change in my gravitational potential 
energy, itself subdivides into three factors: 

energy = mass x gravity x height. (1.22) 

m g h 



power 




energy time 




m g h 



In the academic building at my university, a building with high 
ceilings and staircases, I bounded up a staircase three stairs at a 
time. The staircase was about 12 feet or 3.5 meters high. There- 
fore, my mechanical energy output was roughly 2000 joules: 

E ~ 65 kg x 10 ms -2 x 3.5 m ~ 2000 J. (1.23) 




3.5 m 



(The units are fine: 1 J = 1 kg m 2 s 2 .) 



The remaining leaf is the time: how long the climb took me. I made it in 
6 seconds. In contrast, several students made it in 3.9 seconds — the power 
of youth! My mechanical power output was about 2000 joules per 6 sec- 
onds, or about 300 watts. (To check whether the estimate is reasonable, try 
Problem 1.12, where you estimate the typical human basal metabolism.) 



This burst power output should be close to the sustained power output of 
a trained athlete. And it is. As an example, in the Alpe d'Huez climb in the 
1989 Tour de France, the winner — Greg LeMond, a world-class athlete — put 
out 394 watts (over a 42.5-minute period). The cyclist Lance Armstrong, 
during the time-trial stage during the Tour de France in 2004, generated 
even more: 495 watts (roughly 7 watts per kilogram). However, he pub- 
licly admitted to blood doping to enhance performance. Indeed, because 
of widespread doping, many cycling power outputs of the 1990s and 2000s 
are suspect; 400 watts stands as a legitimate world-class sustained power 
output. 
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Problem 1.11 Energy in a 9-volt battery 

Estimate the energy in a 9-volt battery. Is it enough to launch the battery into orbit? 

Problem 1.12 Basal metabolism 

Based on our daily caloric consumption, estimate the human basal metabolism. 

Problem 1.13 Energy measured in person flights of stairs 

How many flights of stairs can you climb using the energy in a stick (100 grams) 
of butter? 



1.7.3 Heat of vaporization of water 

Our final physical estimate concerns the most important liquid on Earth. 

What is the heat of vaporization of water? 



Because water covers so much of the Earth and is such an important part 
of the atmosphere (clouds!), its heat of vaporization strongly affects our 
climate — whether through rainfall (Section 3.4.3) or air temperatures. 

Heat of vaporization is defined as a ratio: heat of 

, vaporization 

energy to evaporate a substance 
amount of the substance 

where the amount of substance can be measured 
in moles, by volume, or (most commonly) by mass. 

The definition provides the structure of the tree a substance substance 

and of the estimate based on divide-and-conquer 

reasoning. 

For the mass of the substance, choose an amount of water that is easy to 
imagine — ideally, an amount familiar from everyday life. Then your gut 
can help you make estimates. Because I often boil a few cups of water at a 
time, and each cup is few hundred milliliters. I'll imagine 1 liter or 1 kilo- 
gram of water. 

The other leaf, the required energy, requires more thought. There is a com- 
mon confusion about this energy that is worth discussing. 

Is it the energy required to bring the water to a boil? 




energy to evaporate mass of the 



No: The energy has nothing to do with the energy required to bring the 
water to a boil! That energy is related to water's specific heat c p . The heat of 
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burner 

power 



vaporization depends on the energy needed to evaporate — boil away — the 
water, once it is boiling. (You compare these energies in Problem 5.61.) 

Energy subdivides into power times time (as when we es- 
timated human power output in Section 1.7.2). Here, the 
power could be the power output of one burner; the time 
is the time to boil away the liter of water. To estimate these 
leaves, let's hold a gut conference. 

For the time, my dialogue is as follows. 

LB: How does 1 minute sound as a lower bound? 

RB: Way too short — you've left boiling water on the stove 
unattended for longer without its boiling away! 

LB: How about 3 minutes? 

RB: That's on the low side. Maybe that's the lower bound. 

LB: Okay. For the upper bound, how about 100 minutes? 

RB: That time feels way too long. Haven't we boiled away pots of water in far 
less time? 

LB: What about 30 minutes? 

RB: That's long, but I wouldn't be shocked, only fairly surprised, if it took that 
long. It feels like the upper bound. 

My range is therefore 3. ..30 minutes. Its midpoint — the geometric mean of 
the endpoints — is about 10 minutes or 600 seconds. 

For variety, let's directly estimate the burner power, without estimating 
lower and upper bounds. 

LB: How does 100 watts feel? 

RB: Way too low: That's a lightbulb! If a lightbulb could boil away water so 
quickly, our energy troubles would be solved. 

LB (feeling chastened): How about 1000 watts (1 kilowatt)? 

RB: That's a bit low. A small appliance, such as a clothes iron, is already 1 kilo- 
watt. 

LB (raising the guess more slowly): What about 3 kilowatts? 

RB: That burner power feels plausible. 

Let's check this power estimate by subdividing power into two factors, volt- 
age and current: 

power = voltage x current. 



heat of 
vaporization 



evaporation 

time 



(1.25) 
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An electric stove requires a line voltage of 220 volts, even in the United 
States where most other appliances require only 110 volts. A standard fuse 
is about 15 amperes, which gives us an idea of a large current. If a burner 
corresponds to a standard fuse, a burner supplies roughly 3 kilowatts: 

220 V x 15 A * 3000 W. (1.26) 



This estimate agrees with the gut estimate, so both methods gain plausibil- 
ity — which should give you confidence to use both methods for your own 
estimates. As a check, I looked at the circuit breaker connected to my range, 
and it is rated for 50 amperes. The range has four burners and an oven, so 
15 amperes for one burner (at least, for the large burner) is plausible. 



We now have values for all the leaf nodes. Prop- 
agating the values toward the root gives the heat 
of vaporization (L vap ) as roughly 2 megajoules per 
kilogram: 



L 



vap 



power time 

3kW x 600s 



1 kg 

mass 

2 x 10 6 Jkg -1 . 



(1.27) 



The true value is about 2.2xl0 6 joules per kilogram. 
This value is one of the highest heats of vaporiza- 
tion of any liquid. As water evaporates, it carries 
away significant amounts of energy, making it an 
excellent coolant (Problem 1.17). 



heat of vaporization 
2 x 10 6 J 
k g 




energy mass 

2 x 10 6 J 1kg 




burner power evaporation time 

3kW 10 min 



1.8 Summary and further problems 

The main lesson that you should take away is courage: No problem is too 
difficult. We just use divide-and-conquer reasoning to dissolve difficult 
problems into smaller pieces. (For extensive practice, see the varied exam- 
ples in the Guesstimation books [47 and 48].) This tool is a universal solvent 
for problems social and scientific. 

Problem 1.14 Per-capita land area 

Estimate the land area per person for the world, for your home country, and for 
your home state or province. 
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Problem 1.15 Mass of the Earth 

Estimate the mass of the Earth. Then look it up (p. xvii) to check your estimate. 

Problem 1.16 Billion 

How long would it take to count to a billion (10 9 )? 

Problem 1.17 Sweating 

Estimate how much water you need to drink to replace water lost to evapora- 
tion, if you ride a bicycle vigorously for 1 hour. Represent your estimate as a 
divide-and-conquer tree. Hint: Humans are only about 25 percent efficient in gen- 
erating mechanical work. 

Problem 1.18 Pencil line 

How long a line can you write with a pencil? 

Problem 1.19 Pine needles 

Estimate the number of needles on a pine tree. 

Problem 1.20 Hairs 

How many hairs are on your head? 
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Divide-and-conquer reasoning, the tool introduced in Chapter 1, is power- 
ful, but it is not enough by itself to organize the complexity of the world. 
Try, for example, to manage the millions of files on a computer — even my 
laptop says that it has almost 3 million files. Without any organization, with 
all the files in one monster directory or folder, you could never find informa- 
tion that you need. However, simply using divide and conquer by dividing 
the files into groups — the first 100 files by date, the second 100 files by date, 
and so on — does not disperse the chaos. A better solution is to organize the 
millions of files into a hierarchy: as a tree of folders and subfolders. The ele- 
ments in this hierarchy get names — for example, "photos of the children" or 
"files for typesetting this book" — and these names guide us to the needed 
information. 

Naming — or, more technically, abstraction — is our other tool for organizing 
complexity. A name or an abstraction gets its power from its reusability. 
Without reusable ideas, the world would become unmanageably compli- 
cated. We might ask, "Could you, without tipping it over, move the wooden 
board glued to four thick sticks toward the large white plastic circle?" in- 
stead of, "Could you slide the chair toward the table?" The abstractions 
"chair," "slide," and "table" compactly represent complex ideas and physi- 
cal structures. (And even the complex question itself uses abstractions.) 
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2 Abstraction 



Similarly, without good abstractions we could hardly calculate, and mod- 
ern science and technology would be impossible. As an illustration, imag- 
ine the pain of the following calculation: 

XXVII x XXXVI, (2.1) 

which is 27 x 36 in Roman numerals. The problem is not that the notation is 
unfamiliar, but rather that it is not based on abstractions useful for calcula- 
tion. Not least, it does not lend itself to divide-and-conquer reasoning; for 
example, even though V (5) is a part of XXVII, V x XXXVI has no obvious an- 
swer. In contrast, our modern number system, based on the abstractions of 
place value and zero, makes the whole multiplication simple. Notations are 
abstractions, and good abstractions amplify our intelligence. In this chap- 
ter, we will practice making abstractions, discuss their high-level purpose, 
and continue to practice. 



2.1 Energy from burning hydrocarbons 

Our understanding of the world is built on layers of abstrac- 
tions. Consider the idea of a fluid. At the bottom of the ab- 
straction hierarchy are the actors of particle physics: quarks 
and electrons. Quarks combine to build protons and neu- 
trons. Protons, neutrons, and electrons combine to build 
atoms. Atoms combine to build molecules. And large collec- 
tions of molecules act, under many conditions, like a fluid. 

The idea of a fluid is a new unit of thought. It helps us un- 
derstand diverse phenomena, without our having to calcu- 
late or even know how quarks and electrons interact to pro- 
duce fluid behavior. As one consequence, we can describe 
the behavior of air and water using the same equations (the 
Navier-Stokes equations of fluid mechanics); we need only 
to use different values for the density and viscosity. Then 
atmospheric cyclones and water vortices, although they result from widely 
differing sets of quarks and electrons and their interactions, can be under- 
stood as the same phenomenon. 

A similarly powerful abstraction is a chemical bond. We'll use this abstrac- 
tion to estimate a quantity essential to our bodies and to modern society: 
the energy released by burning chains made of hydrogen and carbon atoms 
(hydrocarbons). A hydrocarbon can be abstracted as a chain of CH 2 units: 



fluid 



molecules 



atoms 

\ 

protons, 
neutrons 



quarks 




electrons 
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Burning a CH 2 unit requires oxygen (0 2 ) and releases carbon dioxide (C0 2 ), 
water, and energy: 

CH 2 + fo 2 — ♦ C0 2 + H 2 0 + energy. (2.2) 

For a hydrocarbon with eight carbons — such as octane, a prime component 
of motor fuel — simply multiply this reaction by 8: 

(CH 2 ) 8 + 12 0 2 — ♦ 8 C0 2 + 8 H 2 0 + lots of energy. (2.3) 

(The two additional hydrogens at the left and right ends of octane are not 
worth worrying about.) 

How much energy is released by burning one CH Z unit? 



To make this estimate, use the table of bond 
energies. It gives the energy required to break 
(not make) a chemical bond — for example, be- 
tween carbon and hydrogen. However, there 
is no unique carbon-hydrogen (C-H) bond. 

The carbon-hydrogen bonds in methane are 
different from the carbon-hydrogen bonds in 
ethane. To make a reusable idea, we neglect 
those differences — placing them below our 
abstraction barrier — and make an abstraction 
called the carbon-hydrogen bond. So the ta- 
ble, already in its first column, is built on an 
abstraction. 

The second gives the bond energy in kilo- 
calories per mole of bonds. A kilocalorie is 
roughly 4000 joules, and a mole is Avogadro's 

number (6xl0 23 ) of bonds. The third column gives the energy in the SI units 
used by most of the world, kilojoules per mole. The final column gives the 
energy in electron volts (eV) per bond. An electron volt is 1.6 x 10 -19 joules. 
An electron volt is suited for measuring atomic energies, because most bond 
energies have an easy-to-grasp value of a few electron volts. I wish most of 
the world used this unit! 
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hydrocarbon unit. 






bond 

( kcal \ 
\ mol / 


energy 

m 

\ mol J 


1 xC— C 


1 x 83 


1 x 347 


2 xC— H 


2 x 99 


2 x 414 


1.5 x 0=0 


1.5 x 119 


1.5 x 498 


Total 


460 


1925 



bond energy 



The left side of the reaction has two carbon-hy- 
drogen bonds, 1.5 oxygen-oxygen double bonds, 
and one carbon-carbon bond (connecting the car- 
bon atom in the CH 2 unit to the carbon atom in 
a neighboring unit). The total, 460 kilocalories or 

1925 kilojoules per mole, is the energy required to break the bonds. It is an 
energy input, so it reduces the net combustion energy. 

The right side has two carbon-oxygen double bonds 
and two oxygen-hydrogen bonds. The total for the 
right side, 606 kilocalories or 2535 kilojoules per mole, 
is the energy released in forming these bonds. It is the 
energy produced, so it increases the net combustion 
energy. 

The net result is, per mole of CH 2 , an energy release of 
606 minus 460 kilocalories, or approximately 145 kilo- 
calories (610 kilojoules). Equivalently, it is also about 6 electron volts per 
CH 2 unit — about 1.5 chemical bonds' worth of energy. The combustion en- 
ergy is also useful as an energy per mass rather than per mole. A mole of 
CH 2 units weighs 14 grams. Therefore, 145 kilocalories per mole is roughly 
10 kilocalories or 40 kilojoules per gram. This energy density is worth mem- 
orizing because it gives the energy released by burning oil and gasoline or 
by metabolizing fat (even though fat is not a pure hydrocarbon). 
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hydrogen (H 2 ) 
methane (CH 4 ) 
gasoline (C 8 H 18 ) 
stearic acid (C 18 H 36 0 2 ) 
glucose (C 6 H 12 O s ) 



combustion energy 
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The preceding table, adapted from Oxford University's "Virtual Chemistry" 
site, gives actual combustion energies for plant and animal fuel sources 
(with pure hydrogen included for fun). The penultimate entry, stearic acid, 
is a large component of animal fat; animals store energy in a substance with 
an energy density comparable to the energy density in gasoline — roughly 
10 kilocalories or 40 kilojoules per gram. Plants, on the other hand, store 
energy in starch, which is a chain of glucose units; glucose has an energy 
density of only roughly 4 kilocalories per gram. This value, the energy den- 
sity of food carbohydrates (sugars and starches), is also worth memorizing. 
It is significantly lower than the energy density of fats: Eating fat fills us up 
much faster than eating starch does. 

How can we explain the different plant and animal energy-storage densities? 

Plants do not need to move, so the extra weight required by using lower-den- 
sity energy storage is not so important. The benefit of the simpler glucose 
metabolic pathway outweighs the drawback of the extra weight. For ani- 
mals, however, the large benefit of lower weight outweighs the metabolic 
complexity of burning fats. 

Problem 2.1 Estimating the energy density of common foods 

In American schools, the traditional lunch is the peanut-butter-and-jelly sandwich. 
Estimate the energy density in peanut butter and in jelly (or jam). 

Problem 2.2 Peanut butter as fuel 

If you could convert all the combustion energy in one tablespoon (15 grams) of 
peanut butter into mechanical work, how many flights of stairs could you climb? 

Problem 2.3 Growth of grass 

How fast does grass grow? Is the rate limited by rainfall or by sunlight? 

2.2 Coin-flip game 

The abstractions of atoms, bonds, and bond energies have been made for us 
by the development of science. But we often have to make new abstractions. 
To develop this skill, we'll analyze a coin game where two players take turns 
flipping a (fair) coin; whoever first tosses heads wins. 

What is the probability that the first player wins? 

First get a feel for the game by playing it. Here is one round: TH. The first 
player missed the chance to win by tossing tails (T); but the second player 
tossed heads (H) and won. 
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Playing many games might reveal a pattern to us or suggest how to com- 2 TH 
pute the probability. However, playing many games by flipping a real 2 TH 
coin becomes tedious. Instead, a computer can simulate the games, sub- 1 H 
stituting pseudorandom numbers for a real coin. Here are several runs 2 TH 
produced by a computer program. Each line begins with 1 or 2 to indicate 1 TTH 
which player won the game; the rest of the line shows the coin tosses. In 2 TTTH 
these ten iterations, each player won five times. A reasonable conjecture 2 TH 
is that each player has an equal chance to win. However, this conjecture, l H 
based on only ten games, cannot be believed too strongly. 1 h 

Let's try 100 games. Now even counting the wins becomes tedious. My 1 H 
computer counted for me: 68 wins for player 1, and 32 wins for player 2. 

The probability of player l's winning now seems closer to 2/3 than to 1/2. 



To find the exact value, let's diagram the game as a tree re- 
flecting the alternative endings of the game. Each layer rep- 
resents one flip. The game ends at a leaf, when one player 
has tossed heads. The shaded leaves show the first player's 
wins — for example, after H, TTH, or TTTTH. The probabili- 
ties of these winning ways are 1 / 2 (for H), 1 /8 (for TTH), and 
1/32 (for TTTTH). The sum of all these winning probabilities 
is the probability of the first player's winning: 



start 




1 1 1 
2 + 8 + 32 + 



(2.5) 



To sum this infinite series without resorting to formulas, make 
an abstraction: Notice that the tree contains, one level down, 
a near copy of itself. (In this problem, the abstraction gets 
reused within the same problem. In computer science, such a 
structure is called recursive.) For if the first player tosses tails, 
the second player starts the game in the position of the first 
player, with the same probability of winning. 




To benefit from this equivalence, let's name the reusable idea, namely the 
probability of the first player's winning, and call it p. The second player 
wins the game with probability p/2: The factor of 1/2 is the probability 
that the first player tosses tails; the factor of p is the probability that the 
second player wins, given that the first player blew his chance by tossing 
tails on the first toss. 



Because either the first or the second player wins, the two winning proba- 
bilities add to 1: 
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p + p/2 = 1. (2.6) 

P (first player wins) P (second player wins) 

The solution is p = 2/3, as suggested by the 100-game simulation. The ben- 
efit of the abstraction solution, compared to calculating the infinite proba- 
bility sum explicitly, is insight. In the abstraction solution, the answer has 
to be what it is. It leaves almost nothing to remember. An amusing illustra- 
tion of the same benefit comes from the problem of the fly that zooms back 
and forth between two approaching trains. 

If the fly starts when the trains are 60 miles apart, each train travels at 20 miles per 
hour, and the fly travels at 30 miles per hour, how far does the fly travel, in total, 
before meeting its maker when the trains collide? (Apologies that physics problems 
are often so violent.) 



Right after hearing the problem, John von Neumann, inventor of game the- 
ory and the modern computer, gave the correct distance. "That was quick," 
said a colleague. "Everyone else tries to sum the infinite series." "What's 
wrong with that?" said von Neumann. "That's how I did it." In Problem 2.7, 
you get to work out the infinite-series and the insightful solutions. 



Problem 2.4 Summing a geometric series using abstraction 

Use abstraction to find the sum of the infinite geometric series 

1 + r + r 2 + r 3 + (2.7) 



Problem 2.5 Using the geometric-series sum 

Use Problem 2.4 to check that the probability of the first player's winning is 2/3: 
111 2 

+ ^ + (2-! 



Problem 2.6 Nested square roots 

Evaluate these infinite mixes of arithmetic and square roots: 



y 3 x ^3x 

-\J 2 + ^2 + 




(2.9) 

(2.10) 



Problem 2.7 Two trains and a fly 

Find the insightful and the infinite-series solution to the problem of the fly and 
the approaching trains (Section 2.2). Check that they give the same answer for the 
distance that the fly travels! 
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Problem 2.8 Resistive ladder 

In the following infinite ladder of 1-ohm resistors, what is the resistance between 
points A and B? This measurement is indicated by the ohmmeter connected be- 
tween these points. 



ID ID lO IQ IQ lO 




2.3 Purpose of abstraction 

The coin game (Section 2.2), like the geometric series (Problem 2.4) or the 
resistive ladder (Problem 2.8), contained a copy of itself. Noticing this reuse 
greatly simplified the analysis. Abstraction has a second benefit: giving 
us a high-level view of a problem or situation. Abstractions then show us 
structural similarities between seemingly disparate situations. 

As an example, let's revisit the geometric mean, introduced in Section 1.6 
to make gut estimates. The geometric mean of two nonnegative quantities 
n and b is defined as 

geometric mean = Job . (2.11) 

This mean is called the geometric mean because it has 
a pleasing geometric construction. Divide the diameter 
of a circle into two lengths, a and b, and inscribe a right 
triangle whose hypotenuse is the diameter. The triangle's 
altitude is the geometric mean of a and b. 

This mean reappears in surprising places, including the 
beach. When you stand at the shore and look at the horizon, you are seeing 
a geometric mean. The distance to the horizon is the geometric mean of 
two important lengths in the problem (Problem 2.9). 

For me, its most surprising appearance was in the "Programming and Prob- 
lem-Solving Seminar" course taught by Donald Knuth [40] (who also cre- 
ated TpX, the typesetting system for this book). The course, taught as a se- 
ries of two-week problems, helped first-year PhD students transition from 
undergraduate homework problems to PhD research problems. A home- 
work problem requires perhaps 1 hour. A research problem requires, say. 
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1000 hours: roughly a year of work, allowing for other projects. (A few prob- 
lems stapled together become a PhD.) In the course, each 2-week module re- 
quired about 30 hours — approximately the geometric mean of the two end- 
points. The modules were just the right length to help us cross the bridge 
from homework to research. 



Problem 2.9 Horizon distance 

How far is the horizon when you are standing at the shore? Hint: It's farther for 
an adult than for a child. 



Problem 2.10 Distance to a ship 

Standing at the shore, you see a ship (drawn to scale) with a 10-me- 
ter mast sail into the distance and disappear from view. How far 
away was it when it disappeared? 
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As further evidence that the geometric mean is a useful abstraction, the idea 
appears even when there is no geometric construction to produce it, such 
as in making gut estimates. We used this method in Section 1.6 to estimate 
the population density and then the population of the United States. Let's 
practice by estimating the oil imports of the United States in barrels per 
year — without the divide-and-conquer reasoning of Section 1.4. 

The method requires that the gut supply a lower and an upper bound. My 
gut reports back that it would feel fairly surprised if the imports were less 
than 10 million barrels per year. On the upper end, my gut would be fairly 
surprised if the imports were higher than 1 trillion barrels per year — a bar- 
rel is a lot of oil, and a trillion is a large number! 

You might wonder how your gut too can come up with such large numbers 
and how you can have any confidence in them. Admittedly, I have practiced 
a lot. But you can practice too. The key is the practice effectively. First, have 
the courage to guess even when you feel anxious about it (I feel this anxiety 
still, so I practice this courage often). Second, compare your guess to values 
in which you can place more confidence — for example, to your own more 
careful estimates or to official values. The comparison helps calibrate your 
gut (your right brain) to these large magnitudes. You will find a growing 
and justified confidence in your judgment of magnitude. 

My best guess for the amount is the geometric mean of the lower and upper 
estimates: 

V 10 million x 1 trillion barrels (2.12) 



year 
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The result is roughly 3 billion barrels per year — close to the our estimate 
using divide and conquer and close to the true value. In contrast, the arith- 
metic mean would have produced an estimate of 500 billion barrels per year, 
which is far too high. 

Problem 2.11 Arithmetic-mean-geometric-mean inequality 

Use the geometric construction for the geometric mean to show that the arithmetic 
mean of a and b (assumed to be nonnegative) is always greater than or equal to 
their geometric mean. When are the means equal? 

Problem 2.12 Weighted geometric mean 

A generalization of the arithmetic mean of a and b as (a + b)/2 is to give a and 
b unequal weights. What is the analogous generalization for a geometric mean? 
(The weighted geometric mean shows up in Problem 6.29 when you estimate the 
contact time of a ball bouncing from a table.) 



2.4 Analogies 

Because abstractions are so useful, it is helpful to have methods for making 
them. One way is to construct an analogy between two systems. Each com- 
mon feature leads to an abstraction; each abstraction connects our knowl- 
edge in one system to our knowledge in the other system. One piece of 
knowledge does double duty. Like a mental lever, analogy and, more gen- 
erally, abstraction are intelligence amplifiers. 



2.4.1 Electrical-mechanical analogies 



An illustration with many abstractions on which we can practice is the anal- 
ogy between a spring-mass system and an inductor-capacitor (LC) circuit. 

L 



-mm— 






(2.13) 



In the circuit, the voltage source — the V in on its left side — supplies a cur- 
rent that flows through the inductor (a wire wrapped around an iron rod) 
and capacitor (two metal plates separated by air). As current flows through 
the capacitor, it alters the charge on the capacitor. This "charge" is confus- 
ingly named, because the net charge on the capacitor remains zero. Instead, 
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"charge" means that the two plates of the capacitor hold opposite charges, 
Q and — Q, with Q A 0. The current changes Q. The charges on the two 
plates create an electric field, which produces the output voltage V out equal 
to Q/C (where C is the capacitance). 

For most of us, the circuit is less familiar than the spring-mass system. 
However, by building an analogy between the systems, we transfer our un- 
derstanding of the mechanical to the electrical system. 

In the mechanical system, the fundamental variable 
is the mass's displacement x. In the electrical sys- 
tem, it is the charge Q on the capacitor. These vari- 
ables are analogous so their derivatives should also 
be analogous: Velocity {v), the derivative of posi- 
tion, should be analogous to current (I), the derivative of charge. 



spring 

variable x 

1st derivative v 

2nd derivative a 



Let's build more analogy bridges. The derivative of velocity, which is the 
second derivative of position, is acceleration (a). Therefore, the derivative 
of current (dl/dt) is the analog of acceleration. This analogy will be useful 
shortly when we find the circuit's oscillation frequency. 



These variables describe the state of the systems and how that state changes: 
They are the kinematics. But without the causes of the motion — the dy- 
namics — the systems remain lifeless. In the mechanical system, dynamics 
results from force, which produces acceleration: 



a = 



F 
m ' 



(2.14) 



Acceleration is analogous to change in current dl/dt, which is produced by 
applying a voltage to the inductor. For an inductor, the governing relation 
(analogous to Ohm's law for a resistor) is 



d[ _ V 
dt L ' 



(2.15) 



where L is the inductance, and V is the voltage across the inductor. Based 
on the common structure of the two relations, force F and voltage V must 
be analogous. Indeed, they both measure effort: Force tries to accelerate 
the mass, and voltage tries to change the inductor current. Similarly, mass 
and inductance are analogous: Both measure resistance to the correspond- 
ing effort. Large masses are hard to accelerate, and large-L inductors resist 
changes to their current. (A mass and an inductor, in another similarity, 
both represent kinetic energy: a mass through its motion, and an inductor 
through the kinetic energy of the electrons making its magnetic field.) 



circuit 

Q 

I 

dl/dt 
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Turning from the mass-inductor analogy, let's look at the spring-capacitor 
analogy. These components represent the potential energy in the system: 
in the spring through the energy in its compression or expansion, and in 
the capacitor through the electrostatic potential energy due to its charge. 



Force tries to stretch the spring but meets a resistance k: The stiffer the 
spring (the larger its k), the harder it is to stretch. 




Analogously, voltage tries to charge the capacitor but meets a resistance 
1 /C: The larger the value of 1 /C, the smaller the resulting charge. 




(2.17) 



Based on the common structure of the relations for x and Q, spring constant 
k must be analogous to inverse capacitance 1/C. Here are all our analogies. 



mechanical electrical 



kinematics 

fundamental variable x Q 

first derivative v I 

second derivative a dl/dt 

dynamics 

effort F V 

resistance to effort (kinetic component) m L 

resistance to effort (potential component) k 1/C 



From this table, we can read off our key result. Start with the natural (an- 
gular) frequency a; of a spring-mass system: to = A jk/m . Then apply the 
analogies. Mass m is analogous to inductance L. Spring constant k is anal- 
ogous to inverse capacitance 1/C. Therefore, co for the LC circuit is 1/{LC : 



co = 




1 

{LC‘ 



(2.18) 



Because of the analogy bridges, one formula, the natural frequency of a 
spring-mass system, does double duty. More generally, whatever we learn 
about one system helps us understand the other system. Because of the 
analogies, each piece of knowledge does double duty. 
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2.4.2 Energy density in the gravitational field 



With the electrical-mechanical analogy as practice, let's try a less famil- 
iar analogy: between the electric and the gravitational field. In particular, 
we'll connect the energy densities (energy per volume) in the correspond- 
ing fields. An electric field E represents an energy density of e 0 E l /2, where 
6 q is the permittivity of free space appearing in the electrostatic force be- 
tween two charges and q 2 : 



4ne 0 r 2 



( 2 . 19 ) 



Because electrostatic and gravitational forces are both inverse-square forces 
(the force is proportional to 1 /r 2 ), the energy densities should be analogous. 
Not least, there should be a gravitational energy density. But how is it re- 
lated to the gravitational field? 

To answer that question, our first step is to find the gravitational analog of 
the electric field. Rather than thinking of the electric field only as something 
electric, focus on the common idea of a field. In that sense, the electric field 
is the object that, when multiplied by the charge, gives the force: 

force = charge x field. (2.20) 



We use words rather than the normal symbols, such as E for field or q for 
charge, because the symbols might bind our thinking to particular cases 
and prevent us from climbing the abstraction ladder. 



This verbal form prompts us to ask: What is gravitational charge? In elec- 
trostatics, charge is the source of the field. In gravitation, the source of the 
field is mass. Therefore, gravitational charge is mass. Because field is force 
per charge, the gravitational field strength is an acceleration: 



gravitational field = 



force 

charge 



forrp 

= acceleration, 

mass 



( 2 . 21 ) 



Indeed, at the surface of the Earth, the field strength is g, also called the 
acceleration due to gravity. 

The definition of gravitational field is the first half of the puzzle (we are 
using divide-and-conquer reasoning again). For the second half, we'll use 
the field to compute the energy density. To do so, let's revisit the route from 
electric field to electrostatic energy density: 

E - \e 0 E 2 . 



( 2 . 22 ) 
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With g as the gravitational field, the analogous route is 

y 

g -> - x something x g 2 , (2.23) 

where the "something" represents our ignorance of what to do about c () . 
What is the gravitational equivalent ofe 0 ? 



To find its equivalent, compare the simplest case in both worlds: the field 
of a point charge. A point electric charge q produces a field 



E = -J—-1 
47T6 0 r 2 



( 2 . 24 ) 



A point gravitational charge m (a point mass) produces a gravitational field 
(an acceleration) 




where G is Newton's constant. 



The gravitational field has a similar structure to the electric field. Both 
are inverse-square forces, as expected. Both are proportional to the charge. 
The difference is the constant of proportionality. For the electric field, it is 
1 /4ttc ( j. For the gravitational field, it is simply G. Therefore, G is analogous 
to 1/471 e 0 ; equivalently, e 0 is analogous to 1/47 tG. 



Then the gravitational energy density becomes 
>2 



1 1 
2 X 4 ttG 



x g 2 = 



g 

8nG' 



( 2 . 26 ) 



We will use this analogy in Section 9.3.3 when we transfer our hard-won 
knowledge of electromagnetic radiation to understand the even more subtle 
physics of gravitational radiation. 

Problem 2.13 Gravitational energy of the Sun 

What is the energy in the gravitational field of the Sun? (fust consider the field 
outside the Sun.) 

Problem 2.14 Pendulum period including buoyancy 

The period of a pendulum in vacuum is (for small amplitudes) T = In^l/g , where 
/ is the bob length and g is the gravitational field strength. Now imagine the pen- 
dulum swinging in a fluid (say, air). By replacing g with a modified value, include 
the effect of buoyancy in the formula for the pendulum period. 
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Problem 2.15 Comparing field energies 

Find the ratio of electrical to gravitational field energies in the fields produced by 
a proton. 



2.4.3 Parallel combination 

Analogies not only reuse work, they help us rewrite expressions in compact, 
insightful forms. An example is the idea of parallel combination. It appears 
in the analysis of the infinite resistive ladder of Problem 2.8. 



the ladder all over again 




To find the resistance R across the ladder (in other words, what the ohmme- 
ter measures between the nodes A and B), you represent the entire ladder 
as a single resistor R. Then the whole ladder is 1 ohm in series with the 
parallel combination of 1 ohm and R: 
in 

-^Nwmh- 

m 





The next step in finding R usually invokes the parallel-resistance formula: 
that the resistance of K | and R 2 in parallel is 



R^R 2 
R\ + R 2 



(2.28) 



For our resistive ladder, the parallel combination of 1 ohm with the ladder 
is 1 ohm x R/(l ohm + R). Placing this combination in series with 1 ohm 
gives a resistance 



1Q + 



lQxR 

in + R' 



(2.29) 



This recursive construction reproduces the ladder, only one unit longer. We 
therefore get an equation for R: 



R = 1Q + 



1 QxR 



in + R' 



(2.30) 
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The (positive) solution is R = (1 + «[5)/2 ohms. The numerical part is the 
golden ratio ([> (approximately 1.618). Thus, the ladder, when built with 
1-ohm resistors, offers a resistance of ([> ohms. 

Although the solution is correct, it skips over a reusable idea: the parallel 
combination. To facilitate its reuse, let's name the idea with a notation: 

Ri II R 2 - (2.31) 

This notation is self-documenting, as long as you recognize the symbol || 
to mean "parallel," a recognition promoted by the parallel bars. A good 
notation should help thinking, not hinder it by requiring us to remember 
how the notation works. With this notation, the equation for the ladder 
resistance R is 



r = in + in ii r 



(2.32) 



(the parallel-combination operator has higher priority than — is computed 
before — the addition). This expression more plainly reflects the structure 
of the system, and our reasoning about it, than does the version 



R = in + 



lQxR 
ICt + R' 



(2.33) 



The || notation organizes the complexity. 

Once you name an idea, you find it everywhere. As a child, after my family 
bought a Volvo, I saw Volvos on every street. Similarly, we'll now look at 
examples of parallel combination far beyond the original appearance of the 
idea in circuits. For example, it gives the spring constant of two connected 
springs (Problem 2.16): 

h b hllb 



Problem 2.16 Springs as capacitors 

Using the analogy between springs and capacitors (discussed in Section 2.4.1), ex- 
plain why springs in series combine using the parallel combination of their spring 
constants. 

Another surprising example is the following spring-mass system with two 
masses: 





k 




m 


[aaa/www 


M 
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The natural frequency to, expressed without our || abstraction, is 

k(m + M) 

co = — 

mM 

This form looks complicated until we use the || abstraction: 
k 

co = . 

m || M 



( 2 . 35 ) 



( 2 . 36 ) 



Now the frequency makes more sense. The two masses act like their parallel 
combination m || M: 



n. 

lA/WVWVW m || M 



The replacement mass m || M is so useful that it has a special name: the re- 
duced mass. Our abstraction organizes complexity by turning a three-com- 
ponent system (a spring and two masses) into a simpler two-component 
system. 

In the spirit of notation that promotes insight, use lowercase ("small") m for 
the mass that is probably smaller, and uppercase ("big") M for the mass that 
is probably larger. Then write m\\ M rather than M || m. These two forms 
produce the same result, but the m || M order minimizes surprise: The 
parallel combination of m and M is smaller than either mass (Problem 2.17), 
so it is closer to m, the smaller mass, than to M. Writing m || M, rather than 
M || m, places the most salient information first. 

Problem 2.17 Using the resistance analogy 

By using the analogy with parallel resistances, explain why m || M is smaller than 
m andM. 

Why do the two masses combine like resistors in parallel? 



The answer lies in the analogy between mass and resistance. Resistance 
appears in Ohm's law: 

voltage = resistance x current. ( 2 . 37 ) 

Voltage is an effort. Current, which results from the effort, is a flow. There- 
fore, the more general form — one step higher on the abstraction ladder — is 

effort = resistance x flow. 

In this form, Newton's second law. 



( 2 . 38 ) 
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force = mass x acceleration, (2.39) 

identifies force as the effort, mass as the resistance, and acceleration as the 
flow. 

Because the spring can wiggle either mass, just as current can flow through 
either of two parallel resistors, the spring feels a resistance equal to the par- 
allel combination of the resistances — namely, m\\M. 

Problem 2.18 Three springs connected 

What is the effective spring constant of three springs connected in a line, with 
spring constants 2, 3, and 6 newtons per meter, respectively? 



2.4.4 Impedance as a higher-level abstraction 

Resistance, in the electrical sense, has appeared several times, and it under- 
lies a higher-level abstraction: impedance. Impedance extends the idea of 
electrical resistance to capacitors and inductors. Capacitors and inductors, 
along with resistors, are the three linear circuit elements: In these elements, 
the connection between current and voltage is described by a linear equa- 
tion: For resistors, it is a linear algebraic relation (Ohm's law); for capacitors 
or inductors, it is a linear differential equation. 

Why should we extend the idea of resistance? 

Resistors are easy to handle. When a circuit contains only resistors, we can 
immediately and completely describe how it behaves. In particular, we can 
write the voltage at any point in the circuit as a linear combination of the 
voltages at the source nodes. If only we could do the same when the circuit 
contains capacitors and inductors. 

We can! Start with Ohm's law, 
voltage 

current = , (2.40) 

resistance 

and look at it in the higher-level and expanded form 
1 

flow = x effort. (2.41) 

resistance 

For a capacitor, flow will still be current. But we'll need to find the capac- 
itive analog of effort. This analogy will turn out slightly different from 
the electrical-mechanical analogy between capacitance and spring constant 
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(Section 2.4.1), because now we are making an analogy between capacitors 
and resistors (and, eventually, inductors). For a capacitor, 

charge = capacitance x voltage. (2.42) 

To turn charge into current, we differentiate both sides to get 

d (voltage) 

current = capacitance x — . (2.43) 

To make the analogy quantitative, let's apply to the capacitor the sim- 
plest voltage whose form is not altered by differentiation: 

V = V 0 ei wt , (2.44) 

where V is the input voltage, V () is the amplitude, co is the angular fre- 
quency, and j is the imaginary unit 4— T. The voltage V is a complex num- 
ber; but the implicit understanding is that the actual voltage is the real part 
of this complex number. By finding how the current I (the flow) depends 
on V (the effort), we will extend the idea of resistance to a capacitor. 

With this exponential form, how can we represent the more familiar oscillating 
voltages V 1 cos cot or V 1 sin cot, where V 1 is a real voltage? 

Start with Euler's relation: 

ei wt = cos cot + j sin cot. (2.45) 

To make V 1 cos cot, set y 0 = Vi in V = V 0 e I(Vt . Then 

V = V 1 (cos cot + j sin cot ) . (2.46) 

and the real part of V is just Vj cos cot. 

Making V 1 sin cot is more tricky. Choosing V () = jV-\ almost works: 

V — ;V-| (cos cot + j sin cot) = V 1 (j coscot — sin cot). (2.47) 

The real part is — Vi sin cot, which is correct except for the minus sign. Thus, 
the correct amplitude is V () = —jV\. In summary, our exponential form can 
compactly represent the more familiar sine and cosine signals. 

With this exponential form, differentiation is simpler than with sines or 
cosines. Differentiating V with respect to time just brings down a factor of 
jco, but otherwise leaves the V 0 e ,wl alone: 

^=jco x e’ wt = jvV. 




v 



(2.48) 
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With this changing voltage, the capacitor equation, 

d(voltage) 

current = capacitance x — , (2.49) 

becomes 

current = capacitance x jcv x voltage. (2.50) 

Let's compare this form to its analog for a resistor (Ohm's law): 

1 

current = x voltage. (2.51) 

resistance 

Matching up the pieces, we find that a capacitor offers a resistance 

Z C = (2.52, 



This more general resistance, which depends on the frequency, is called im- 
pedance and denoted Z. (In the analogy of Section 2.4.1 between capacitors 
and springs, we found that capacitor offered a resistance to being charged of 
1/C. Impedance, the result of an analogy between capacitors and resistors, 
contains 1 /C as well, but also contains the frequency in the 1 /jcv factor.) 



Using impedance, we can describe what happens to any sinusoidal signal 
in a circuit containing capacitors. Our thinking is aided by the compact 
notation — the capacitive impedance Z c (or even R c ). The notation hides 
the details of the capacitor differential equation and allows us to transfer 
our intuition about resistance and flow to a broader class of circuits. 

The simplest circuit with resistors and capacitors is the Vin 

so-called low-pass RC circuit. Not only is it the sim- 

plest interesting circuit, it will also be, thanks to fur- 
ther analogies, a model for heat flow. Let's apply the 

impedance analogy to this circuit. 



To help us make and use abstractions, let's imagine defocusing our 
eyes. Under blurry vision, the capacitor looks like a resistor that just 
happens to have a funny resistance Rq = 1/jcoC. Now the entire cir- 
cuit looks just like a pure-resistance circuit. Indeed, it is the simplest 
such circuit, a voltage divider. Its behavior is described by one num- 
ber: the gain, which is the ratio of output to input voltage V oui / l / ln . 




In the RC circuit, thought of as a voltage divider, 

capacitive resistance R c 

^ am total resistance from V m to ground R + R c 



(2.53) 



^OUt 

ground 

^out 

ground 
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Because R c = 1/jojC, the gain becomes 
gain 



l 

jcvC 



R + j 



( 2 . 54 ) 



ju>C 



After clearing the fractions by multiplying by jcoC in the numerator and 
denominator, the gain simplifies to 

gain = TTJJrc- (255) 

Why is the circuit called a low-pass circuit? 



At high frequencies (co — > oo), the jcoRC term in the denominator makes 
the gain zero. At low frequencies (co — > 0), the jcoRC term disappears and 
the gain is 1. High-frequency signals are attenuated by the circuit; low-fre- 
quency signals pass through mostly unchanged. This abstract, high-level 
description of the circuit helps us understand the circuit without our get- 
ting buried in equations. Soon we will transfer our understanding of this 
circuit to thermal systems. 

The gain contains the circuit parameters as the product RC. In the denom- 
inator of the gain, jcoRC is added to 1; therefore, jcoRC, like 1, must have 
no dimensions. Because j is dimensionless (is a pure number), coRC must 
be itself dimensionless. Therefore, the product RC has dimensions of time. 
This product is the circuit's time constant — usually denoted r. 

The time constant has two physical interpretations. To construct them, we 
imagine charging the capacitor using a constant input voltage V 0 ; eventu- 
ally (after an infinite time), the capacitor charges up to the input voltage 
(Vout = V 0 ) and holds a charge Q = CV 0 . Then, at t — 0, we make the input 
voltage zero by connecting the input to ground. 




R 

iM/W— 

c 



v out 



ground 



t < 0 t > 0 

The capacitor discharges through the resistor, and its voltage decays expo- 
nentially: 
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After one time constant r, the capacitor voltage falls by a factor of e toward 
its final value — here, from V Q to V 0 /e . The 1/e time is our first interpretation 
of the time constant. Furthermore, if the capacitor voltage had decayed at 
its initial rate (just after t = 0), it would have reached zero voltage after one 
time constant r — the second interpretation of the time constant. 

The time-constant abstraction hides — abstracts away — the details that pro- 
duced it: here, electrical resistance and capacitance. Nonelectrical systems 
can also have a time constant but produce it by a different mechanism. 
Our high-level understanding of time constants, because it is not limited 
to electrical systems, will help us transfer our understanding of the electri- 
cal low-pass filter to nonelectrical systems. In particular, we are now ready 
to understand heat flow in thermal systems. 

Problem 2.19 Impedance of an inductor 

An inductor has the voltage-current relation 

,7 r dI 

V = L dt ' (256) 

where L is the inductance. Find an inductor's frequency-dependent impedance 
Z L . After finding this impedance, you can analyze any linear circuit as if it were 
composed only of resistors. 



2.4.5 Thermal systems 

The RC circuit is a model for thermal systems — which are not obviously 
connected to circuits. In a thermal system, temperature difference, the ana- 
log of voltage difference, produces a current of energy. Energy current, in 
less fancy words, is heat flow. Furthermore, the current is proportional to 
the temperature difference — just as electric current is proportional to volt- 
age difference. In both systems, flow is proportional to effort. Therefore, 
heat flow can be understood by using circuit analogies. 
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As an example, I often prepare a cup of tea but forget to 
drink it while it is hot. Like a discharging capacitor, the 
tea slowly cools toward room temperature and becomes 
undrinkable. Heat flows out through the mug. Its walls 
provide a thermal resistance; by analogy to an RC cir- 
cuit, let's denote the thermal resistance R t . The heat is 
stored in the water and mug, which form a heat reservoir. This reservoir, 
of heat rather than of charge, provides the thermal capacitance, which we 
denote C t . (Thus, the mug participates in the thermal resistance and capac- 
itance.) Resistance and capacitance are transferable ideas. 

The product R t C t is, by analogy to the RC circuit, the thermal time con- 
stant r. To estimate r with a home experiment (the method we used in 
Section 1.7), heat up a mug of tea; as it cools, sketch the temperature gap 
between the tea and room temperature. In my extensive experience of tea 
neglect, an enjoyably hot cup of tea becomes lukewarm in half an hour. To 
quantify these temperatures, enjoyably warm may be 130 °F (« 55 °C), room 
temperature is 70 °F (« 20 °C), and lukewarm may be 85 °F (« 30 °C). 

130 °F (hot tea) 



t = 0 

85 °F (lukewarm) 



70 °F (room temperature) 

Based on the preceding data, what is the approximate thermal time constant of the 
mug of tea? 

In one thermal time constant, the temperature gap falls by a factor of e (just 
as the voltage gap falls by a factor of e in one electrical time constant). For 
my mug of tea, the temperature gap between the tea and the room started 
at 60 °F: 

enjoyably warm — room temperature = 60 °F. (2.57) 

130°F 70"°F 

In the half hour while the tea cooled in the microwave, the temperature gap 
fell to 15 °F: 
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lukewarm — room temperature = 15 °F. (2.58) 

8?°F 70”°F 

Therefore, the temperature gap decreased by a factor of 4 in half an hour. 
Falling by the canonical factor of e (roughly 2.72) would require less time: 
perhaps 0.3 hours (roughly 20 minutes) instead of 0.5 hours. A more pre- 
cise calculation would be to divide 0.5 hours by In 4, which gives 0.36 hours. 
However, there is little point doing this part of the calculation so precisely 
when the input data are far less precise. Therefore, let's estimate the ther- 
mal time constant r as roughly 0.3 hours. 

Using this estimate, we can understand what happens to the tea mug when, 
as it often does, it spends a lonely few days in the microwave, subject to the 
daily variations in room temperature. This analysis will become our model 
for the daily temperature variations in a house. 

How does a teacup with r « 0.3 hours respond to daily temperature variations? 



C t 



First, set up the circuit analogy. The output signal is T VW/T- 

still the tea's temperature. The input signal is the (si- r °° m R 

nusoidally) varying room temperature. However, 
the ground signal, which is our reference tempera- 
ture, cannot also be the room temperature. Instead, 
we need a constant reference temperature. The sim- 
plest choice is the average room temperature T avg . (After we have trans- 
ferred this analysis to the temperature variation in houses, we'll see that 
the conclusion is the same even with a different reference temperature.) 

The gain connects the amplitudes of the output and input signals: 
amplitude of the output signal 1 



mug and 
tea 



gam 



amplitude of the input signal 1 + jcoT 



(2.59) 



1 av g 



The input signal (room temperature) varies with a frequency / of 1 cycle 
per day. Then the dimensionless parameter cot in the gain is roughly 0.1. 
Here is that calculation: 



2tt x 1 



cycle 



day 



0.3 hr 



1 day 
24 hr 



0.1. 



(2.60) 



The system is driven by a low-frequency signal: co is not large enough to 
make cot comparable to 1. As the gain expression reminds us, the mug of 
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tea is a low-pass filter for temperature variations. It transmits this low-fre- 
quency input temperature signal almost unchanged to the output — to the 
tea temperature. Therefore, the inside (tea) temperature almost exactly fol- 
lows the outside (room) temperature. 



The opposite extreme is a house. Compared to walls 

rr r ^outside MW — 

the mug, a house has a much higher mass and 

therefore thermal capacitance. The resulting time 

constant r = R t C t is probably much longer for a 
house than for the mug. As an example, when 
I taught in sunny Cape Town, where houses are 

often unheated even in winter, the mildly insulated house where I stayed 
had a thermal time constant of approximately 0.5 days. 



hnside 



house 



1 av g 



For this house the dimensionless parameter cot is much larger than it was 
for the tea mug. Here is the corresponding calculation. 



/ 



cycle 

2n x 1 — — x 0.5 days « 3. 
day 



<j} T 



(2.61) 



What consequence does cot 



3 have for the indoor temperature? 



In the Cape Town winter, the outside temperature varied daily between 
45 °F and 75 °F; let's also assume that it varied approximately sinusoidally. 
This 30 °F peak-to-peak variation, after passing through the house low-pass 
filter, shrinks by a factor of approximately 3. Here is how to find that factor 
by estimating the magnitude of the gain. 



|gain| = 



amplitude of T inside 




1 


amplitude of T outside 




1 + ]COT 



(2.62) 



(It is slightly confusing that the outside temperature is the input signal, and 
the inside temperature is the output signal!) Now plug in cot w 3 to get 



|gain| 



1 

1 + 3; 



1 _ 1 
Vl 2 + 3 2 ~ 3 



(2.63) 



In general, when cot » 1, the magnitude of the gain is approximately 1/cot. 

Therefore, the outside peak-to-peak variation of 30 °F becomes a smaller 
inside peak-to-peak variation of 10 °F. Here is a block diagram showing 
this effect of the house low-pass filter. 
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(2.64) 



Our comfort depends not only on the temperature variation (I like a fairly 
steady temperature), but also on the average temperature. 



What is the average temperature indoors? 



It turns out that the average temperature indoors is equal to the average 
temperature outdoors! To see why, let's think carefully about the reference 
temperature (our thermal analog of ground). Before, in the analysis of the 
forgotten tea mug, our reference temperature was the average indoor tem- 
perature. Because we are now trying to determine this value, let's instead 
use a known convenient reference temperature — for example, the cool 10 °C, 
which makes for round numbers in Celsius or Fahrenheit (50 °F). 

The input signal (the outside temperature) varied in winter between 45 °F 
and 75 °F. Therefore, it has two pieces: (1) our usual varying signal with 
the 30 °F peak-to-peak variation, and (2) a steady signal of 10 °F. 

75 °F 



60 °F 
45 °F 

full signal varying piece steady piece 




The steady signal is the difference between the average outside temperature 
of 60 °F and the reference signal of 50 °F. 

Let's handle each piece in turn — we are using divide-and-conquer reason- 
ing again. We just analyzed the varying piece: It passes through the house 
low-pass filter and, with lot « 3, it shrinks significantly in amplitude. In 
contrast, the nonvarying part, which is the average outside temperature, 
has zero frequency by definition. Therefore, its dimensionless parameter 
cor is exactly 0. This signal passes through the house low-pass filter with a 
gain of 1. Asa result, the average output signal (the inside temperature) is 
also 60 °F: the same steady 10 °F signal measured relative to the reference 
temperature of 50 °F. 
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The 10 °F peak-to-peak inside-temperature amplitude is a variation around 
60 °F. Therefore, the inside temperature varies between 55 °F and 65 °F 
(13 °C to 18 °C). Indoors, when I am not often running or otherwise gener- 
ating much heat, I feel comfortable at 68 °F (20 °C). So, as this circuit model 
of heat flow predicts, I wore a sweater day and night in the Cape Town 
house. (For more on using RC circuit analogies for building design, see the 
"Design masterclass" article by Doug King [30].) 

Problem 2.20 When is the house coldest? 

Based on the general form for the gain, 1/ (1 +ju’T), when in the day will the Cape 

Town house be the coldest, assuming that the outside is coldest at midnight? 



2.5 Summary and further problems 

Geometric means, impedances, low-pass filters — these ideas are all abstrac- 
tions. An abstraction connects seemingly random details into a higher-level 
structure that allows us to transfer knowledge and insights. By building ab- 
stractions, we amplify our intelligence. 

Indeed, each of our reasoning tools is an abstraction or reusable idea. In 
Chapter 1, for example, we learned how to split hard problems into tractable 
ones, and we named this process divide-and-conquer reasoning. Don't 
stop with this one process. Whenever you reuse an idea, identify the trans- 
ferable process, and name it: Make an abstraction. With a name, you will 
recognize and reuse it. 

Problem 2.21 From circles to spheres 

In this problem, you first find the area of a circle from its circumference and then 
use analogous reasoning to find the volume of a sphere. 

a. Divide a circle of radius r into pie wedges. Then snip and unroll the circle: 




Use the unrolled picture and the knowledge that the circle's circumference is 
Inr to show that its area is nr 2 . 

b. Now extend the argument to a sphere of radius r: Find its volume given that 
its surface area is 477T 2 . (This method was invented by the ancient Greeks.) 
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Problem 2.22 Gain of an LRC circuit 



Use the impedance of an inductor (Problem 2.19) to 
find the gain of the classic LRC circuit. In this con- 
figuration, in which the output voltage measured 
across the resistor, is the circuit a low-pass filter, a 
high-pass filter, or a band-pass filter? 




Problem 2.23 Continued fraction 

Evaluate the continued fraction 



(2.67) 



Compare this problem with Problem 2.8. 



Problem 2.24 Exponent tower 

Evaluate 

42 . (2.68) 

Here, means 



Problem 2.25 Coaxial cable termination 

In physics and electronics laboratories around the world, the favorite way to con- 
nect equipment and transmit signals is with coaxial cable. The standard coaxial 
cable, RG-58/U, has a capacitance per length of 100 picofarads per meter and an 
inductance per length of 0.29 microhenries per meter. It can be modeled as a long 
inductor-capacitor ladder: 




What resistance R placed at the end (in parallel with the last capacitor) makes the 
cable look like an infinitely long LC cable? 



Problem 2.26 UNIX and Linux 

Using Mike Gancarz's The UNIX Philosophy [17] and Linux and the Unix Philoso- 
phy [18], find examples of abstraction in the design and philosophy of the UNIX 
and Linux operating systems. 





Part II 



Discarding complexity 
without losing information 



You've divided your hard problems into manageable pieces. You've found 
transferable, reusable ideas. When these tools are not enough and prob- 
lems are still too complex, you need to discard complexity — the theme of 
our next three tools. They help us discard complexity without losing in- 
formation. If a system contains a symmetry (Chapter 3) — or what is closely 
related, it is subject to a conservation law — using the symmetry greatly sim- 
plifies the analysis. Alternatively, we often do not care about a part of an 
analysis, because it is the same for all the objects in the analysis. To ig- 
nore those parts, we use proportional reasoning (Chapter 4). Finally, we 
can ensure that our equations do not add apples to oranges. This simple 
idea — dimensional analysis (Chapter 5) — greatly shrinks the space of pos- 
sible solutions and helps us master complexity. 



to master complexity 



organize it 

Parti 



discard it 
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The rain is pouring down and shelter is a few hundred yards away. Do you 
get less wet by running? On the one hand, running means less time for 
raindrops to hit you. On the other hand, running means that the raindrops 
come toward you more directly and therefore more rapidly. The resolution 
is not obvious — until you apply the new tool of this chapter: symmetry and 
conservation. (In Section 3.1.1, we'll resolve this run-in-the-rain question.) 



3.1 Invariants 

We use symmetry and conservation whenever we find a quantity that, de- 
spite the surrounding complexity, does not change. This conserved quan- 
tity is called an invariant. Finding invariants simplifies many problems. 

Our first invariant appeared unannounced in Section 1 .2 when we estimated 
the carrying capacity of a lane of highway. The carrying capacity — the rate 
at which cars flow down the lane — depends on the separation between the 
cars and on their speed. We could have tried to estimate each quantity and 
then the carrying capacity. However, the separation between cars and the 
cars' speeds vary greatly, so these estimates are hard to make reliably. 
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Instead, we invoked the 2-second following rule. As long as drivers obey it, 
the separation between cars equals 2 seconds of driving. Therefore, one car 
flows by every 2 seconds — which is the lane's carrying capacity (in cars per 
time). By finding an invariant, we simplified a complex, changing process. 
When there is change, look for what does not change! (This wisdom is from 
Arthur Engel's Problem-Solving Strategies [12].) 



3.1.1 To run or walk in the rain? 

We'll practice with this tool by deciding whether to run or walk in the rain. 
It's pouring, your umbrella is sitting at home, and home lies a few hundred 
meters away. 

To minimize how wet you become, should you run or walk? 



raindrops 



Let's answer this question with three simplifica- you 
tions. First, assume that there is no wind, so the 
rain is falling vertically. Second, assume that the 
rain is steady. Third, assume that you are a thin 
sheet: You have zero thickness along the direc- 
tion toward your house (this approximation was 
more valid in my youth). Equivalently, your head is protected by a water- 
proof cap, so you do not care whether raindrops hit your head. You try to 
minimize just the amount of water hitting your front. 



Your only degree of freedom — the only parameter that you get to choose — is 
your speed. A high speed leaves you in the rain for less time. However, it 
also makes the rain come at you more directly (more horizontally). But 
what remains constant, independent of your speed, is the volume of air 
that you sweep out. Because the rain is steady, that volume contains a fixed 
number of raindrops, independent of your speed. Only these raindrops hit 
your front. Therefore, you get equally wet, no matter your speed. 



This surprising conclusion is another application of the principle that when 
there is change, look for what does not change. Here, we could change our 
speed by choosing to walk or run. Yet no matter what our speed, we sweep 
out the same volume of air — our invariant. 



home 



Because the conclusion of this invariance analysis, that it makes no differ- 
ence whether you walk or run, is surprising, you might still harbor a nag- 
ging doubt. Surely running in the rain, which we do almost as a reflex, 
provide some advantage over a leisurely stroll. 
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Is it irrational to run to avoid getting wet? 

If you are infinitely thin, and are just a rectangle moving in the rain, then 
the preceding analysis applies: Whether you run or walk, your front will 
absorb the same number of raindrops. But most of us have a thickness, and 
the number of drops landing on our head depends on our speed. If your 
head is exposed and you care how many drops land on your head, then you 
should run. But if your head is covered, feel free to save your energy and 
enjoy the stroll. Running won't keep you any dryer. 

3.1.2 Tiling a mouse-eaten chessboard 

Often a good way to practice a new tool is on a mathematical problem. Then 
we do not add the complexity of the physical world to the problem of learn- 
ing a new tool. Here, therefore, is a mathematical problem: a solitaire game. 

A mouse comes and eats two diagonally opposite corners out 
of a standard, 8x8 chessboard. We have a box of rectangular, 

2x1 dominoes. 

Can these dominoes tile the mouse-eaten chessboard? In other words, 
can we lay down the dominoes to cover every square exactly once 
(with no empty squares and no overlapping dominoes)? 

Placing a domino on the board is one move in this solitaire 
game. For each move, you choose where to place the domino — so you have 
many choices at each move. The number of possible move sequences grows 
rapidly. Instead of examining all these sequences, we'll look for an invari- 
ant: a quantity unchanged by any move of the game. 

Because each domino covers one white square and one black square, the 
following quantity I is invariant (remains fixed): 

I = uncovered black squares — uncovered white squares. ( 3 . 1 ) 

On a regular chess board, with 32 white squares and 32 black squares, the 
initial position has 1 = 0. The nibbled board has two fewer black squares, 
so I starts at 30 — 32 = —2. In the winning position, all squares are covered, 
so I = 0. Because I is invariant, we cannot win: The dominoes cannot tile 
the nibbled board. 

Each move in this game changes the chessboard. By finding what does not 
change, an invariant, we simplified the analysis. 
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Invariants are powerful partly because they are abstractions. The details of 
the empty squares — their exact locations — lie below the abstraction barrier. 
Above the barrier, we see only the excess of black over white squares. The 
abstraction contains all the information we need in order to know that we 
can never tile the chessboard. 



Problem 3.1 Cube solitaire 

A cube has numbers at each vertex; all vertices start at 0 except 
for the lower left corner, which starts at 1. The moves are all of 
the same form: Pick any edge and increment its two vertices 
by one. The goal of this solitaire game is to make all vertices 
multiples of 3. 

For example, picking the bottom edge of the front face and 
then the bottom edge of the back face, makes the following 
sequence of cube configurations: 
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(3.2) 



Although no configuration above wins the game, can you win with a different 
move sequence? If you can win, give a winning sequence. If you cannot win, prove 
that you cannot. 

Hint: Create analogous but simpler versions of this game. 

Problem 3.2 Triplet solitaire 

Here is another solitaire game. Start with the triplet (3, 4, 5) . At each move, choose 
any two of the three numbers. Call the choices a and b. Then make the following 
replacements: 

a — > 0.8fl — 0.6 b; 
b — * 0.6a + 0.8 b. 



(3.3) 



Can you reach (4, 4, 4)? If you can, give a move sequence; otherwise, prove that it 
is impossible. 

Problem 3.3 Triplet-solitaire moves as rotations in space 

At each step in triplet solitaire (Problem 3.2), there are three possible moves, de- 
pending on which pair of numbers from among a, b, and c you choose to replace. 
Describe each of the three moves as a rotation in space. That is, for each move, give 
the rotation axis and the angle of rotation. 
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Problem 3.4 Conical pendulum 

Finding the period of a pendulum, even at small amplitudes, 
requires calculus because of the pendulum's varying speed. 

When there is change, look for what does not change. Accord- 
ingly, Christiaan Huygens (1629-1695), called "the most inge- 
nious watchmaker of all time" [20, p. 79] by the great physi- 
cist Arnold Sommerfeld, analyzed the motion of a pendulum 
moving in a horizontal circle (a conical pendulum). Project- 
ing its two-dimensional motion onto a vertical screen produces 
one-dimensional pendulum motion; thus, the period of the 
two-dimensional motion is the same as the period of one-dimensional pendulum 
motion! Use that idea to find the period of a pendulum (without calculus!). 

3.1.3 Logarithmic scales 

In the solitaire game in Section 3.1.2, a move covered low end of hearing 20 Hz 
two chessboard squares with a domino. In the game piano middle C 262 Hz 

of understanding the world, a frequent move is chang- highest piano C 4186 Hz 

ing the system of units. As in solitaire, ask, "When high end of hearing 20 kHz 
such a move is made, what is invariant?" As an exam- 
ple to crystallize our thinking, here are frequencies related to human hear- 
ing. Let's graph them using kilohertz (kHz) as the unit. The frequencies 
then arrange themselves as follows: 

low end high C high end 




middle C 

Now let's change units from kilohertz to hertz (Hz) — and keep the 0, 10, and 
20 labels at their current positions on the page. This change magnifies every 
spacing by a factor of 1000: The new 20 hertz is where 20 kilohertz was 
(about 4 inches or 10 centimeters to the right of the origin), and 20 kilohertz, 
the high end of human hearing, sits 100 meters to our right — far beyond the 
borders of the page. This new scale is not very useful. 

However, we missed the chance to use an invariant: the ratio between fre- 
quencies. For example, the ratio between the upper end of human hearing 
(20 kilohertz) and middle C (262 hertz) is roughly 80. If we use a repre- 
sentation based on ratio rather than absolute difference, then the spacing 
between frequencies would not change even when we changed the unit. 
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We have such a representation: logarithms! On a logarithmic scale, a dis- 
tance corresponds to a ratio rather than a difference. To see the contrast, 
let's place the numbers from 1 to 10 on a logarithmic scale. 



1 2 3 456789 10 

The physical gap between 1 and 2 represents not their difference but rather 
their ratio, namely 2. According to my ruler, the gap is approximately 3.16 
centimeters. Similarly, the physical gap between 2 and 3 — approximately 
1.85 centimeters — represents the smaller ratio 1.5. In contrast to their rela- 
tive positions on a linear scale, 2 and 3 on a logarithmic scale are closer than 
1 and 2 are. On a logarithmic scale, 1 and 2 have the same separation as 2 
and 4: Both gaps represent a ratio of 2 and therefore have the same physical 
size (3.16 centimeters). 

Problem 3.5 Practice with ratio thinking 

On a logarithmic scale, how does the physical gap between 2 and 8 compare to the 
gap between 1 and 2? Decide based on your understanding of ratios; then check 
your reasoning by measuring both gaps. 

Problem 3.6 More practice with ratio thinking 

Is the gap between 1 and 10 less than twice, equal to twice, or more than twice the 
gap between 1 and 3? Decide based on your understanding of ratios; then check 
your reasoning by measuring both gaps. 

Problem 3.7 Moving along a logarithmic scale 

On the logarithmic scale in the text, the gap between 2 and 3 is approximately 1.85 
centimeters. Where do you land if you start at 6 and move 1.85 centimeters right- 
ward? Decide based on your understanding of ratios; then check your reasoning 
by using a ruler to find the new location. 

Problem 3.8 Extending the scale to the right 

On the logarithmic scale in the text, the gap between 1 and 10 is approximately 
10.5 centimeters. If the scale were extended to include numbers up to 1000, how 
large would the gap between 10 and 1000 be? 

Problem 3.9 Extending the scale to the left 

If the logarithmic scale were extended to include numbers down to 0.01, how far 
to the left of 1 would you have to place 0.04? 



On a logarithmic scale, the frequencies related to hearing arrange them- 
selves as follows: 
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low end middle C 



high C 



high end 



I I high ' 

i i i i I i I 



id- 
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10 ° 



id 1 



io 2 



id 3 



id 4 



id 5 



Hz 



Changing the units to kilohertz just shifts all the frequencies, but leaves 
their relative positions invariant: 



low end middle C 



high C 



high end 



1,1,1 . i 



10 “ 



10 “ 



10 ° 



Id 1 



Id 2 



Id 3 



_l_ 

Id 4 



- 1 kHz 
Id 5 



Problem 3.10 Acoustic energy fluxes 

In acoustics, sound intensity is measured by energy flux, which is measured in deci- 
bels (dB) — a logarithmic representation of watts per square meter. On the decibel 
scale, 0 decibels corresponds to the reference level of 10 -12 watts per square meter. 
Every 10 decibels (or 1 bel) represents an increase in energy flux of a factor of 10 
(thus, 20 decibels represents a factor-of-100 increase in energy flux). 

a. How many watts per square meter is 60 decibels (the sound level of normal 
conversation)? 

b. Place the following energy fluxes on a decibel scale: 10 -9 watts per square me- 
ter (an empty church), 10 -2 watts per square meter (front row at an orchestra 
concert), and 1 watt per square meter (painfully loud). 

Logarithmic scales offer two benefits. First, as we saw explicitly, logarithmic 
scales incorporate invariance. The second benefit was only implicit in the 
previous discussion: Logarithmic scales, unlike linear scales, allow us to 
represent a huge range. For example, if we include power-line hum (50 or 
60 hertz) on the linear frequency scale on p. 61, we can hardly distinguish 
its position from 0 kilohertz. The logarithmic scale has no problem. 



low end middle C high C 



10- 



ld“ 



10° 



Id 1 



1 , 1,1 

fid 2 id 3 



high end 

I I 



id 4 



id 5 



Hz 



power-line hum 



In our fallen world, benefits usually conflict. One usually has to sacrifice 
one benefit for another (speed for accuracy or justice for mercy). With log- 
arithmic scales, however, we can eat our cake and have it. 
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Problem 3.11 Labeling a logarithmic scale 

Let's make a scale to represent sizes in the universe, from protons (10 -15 meters) 
to galaxies (10 30 meters), with people and bacteria in between. With such a large 
range, we should use a logarithmic scale for size L . Which one of these two ways 
of labeling the scale is correct, and which way is nonsense? 



a. 

b. 



-10 0 



10 30 10 ° 



10 20 



10 10 10 20 



30 



10 30 



lo Sio 1 



L (m) 



Logarithmic scales can make otherwise obscure symbolic calculations intu- 
itive. An example is the geometric mean, which we used in Section 1.6 to 
make gut estimates: 

estimate = flower bound x upper bound . 

Geometric means also occur in the physical world. 

As you found in Problem 2.9, the distance d to the 
horizon, as seen from a height h above the Earth's 
surface, is 

d « 4hD, (3.5) 

where D = 2R Earth is the diameter of the Earth. 

Imagine a lifeguard sitting with his or her eyes at 
a height h = 4 meters above sea level. Then the distance to the horizon is 

d « ( 4 m x 12000 km ) 1/2 . (3.6) 

3 

To do the calculation, we convert 12000 kilometers to 1.2 x 10 7 meters, cal- 
culate 4 x 1.2 x 10 7 , and compute the square root: 

d « \l 4 x 1.2 x 10 7 m « 7000 m = 7 km. (3.7) 



(3.4) 




In this symbolic form with a square root, the calculation obscures the fun- 
damental structure of the geometric mean. We first calculate hD, which is 
an area (and often contains, as it did here, a huge number). Then we take 
the square root to get back a distance. However, the area has nothing to do 
with the structure of the problem. It is merely a bookkeeping device. 

Bookkeeping devices are useful; they are how you tell a computer what to 
calculate. However, to understand the calculation, we, as humans, should 
use a logarithmic scale to represent the distances. This scale captures the 
structure of the problem. 
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left gap 



right gap 



How can we describe the position of the geometric mean fhD ? 

The first clue is that the geometric mean, because it gap endpoints r; 

is a mean, lies somewhere between h and D. This 

property is not obvious from the calculation using left h. ..fhD — 

the square root. To find where the geometric mean 

lies, mind the gaps. On a logarithmic scale, a gap right fhD...D — - 

represents the ratio of its endpoints. As shown 

in the table, the left and right gaps represent the 

same ratio, namely fD/h ! Therefore, on the logarithmic scale, the geomet- 
ric mean lies exactly halfway between h and D. 

Based on this ratio representation, we can rephrase the geometric-mean 
calculation in a form that we can do mentally. 

What distance is as large compared to 4 meters as 12 000 kilometers is compared to 
it? 



For lack of imagination, my first guess is 1 kilometer. It's 12000 times 
smaller than the diameter D (which is 12 000 kilometers), but only 250 times 
larger than the height h (4 meters). My guess of 1 kilometer is therefore 
somewhat too small. 

How is a guess of 10 kilometers? 

It's 2500 times larger than h, but only 1200 times smaller than D. I overshot 
slightly. How about 7 kilometers? It's roughly 1750 times larger than 4 me- 
ters, and roughly 1700 times smaller than 12000 kilometers. Those gaps 
are close to each other, so 7 kilometers is the approximate geometric mean. 

h fhD »; 7 km D 

m 

10 ° 10 2 10 4 10 6 10 8 

Similarly, when we make gut estimates, we should place our lower and up- 
per estimates on a logarithmic scale. Our best gut estimate is then their 
midpoint. What a simple picture! 




s-l o s-i q 
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Should nil quantities be placed on a logarithmic scale? 

No. An illustrative contrast is between size and position. Both quantities 
have the same units. But size ranges from 0 to oo, whereas position ranges 
from — oo to oo. Position therefore cannot be placed on a logarithmic scale 
(where would you put —1 meter?). In contrast, size (a magnitude) belongs 
on a logarithmic scale. In general, location parameters, such as position, 
should not be placed on a logarithmic scale but magnitudes should. 



3.2 From invariant to symmetry operation 

In the preceding examples, we knew the moves of the game and sought 
the invariant. In the mouse-eaten chessboard (Section 3.1.2), the moves are 
putting down a 2 x 1 domino on two adjacent empty squares. The invariant 
was the difference between empty black and white squares. Often, however, 
the benefit of invariants lies in the other direction: You know the invariant 
and seek the moves that preserve it. These moves are called the symmetry 
operations or simply the symmetries. 

We'll first examine this idea in a familiar situation: converting units (Sec- 
tion 3.2.1). Then we'll practice it on a sum solved by the three-year-old Carl 
Friedrich Gauss (Section 3.2.2) and then by finding maxima and minima 
(Section 3.2.3). 



3.2.1 Converting units 



We often convert a quantity from one unit system to another — for example, 
mass from English to metric units or prices from dollars to pounds or euros. 
A useful physical conversion is writing energy density — energy divided by 
amount of stuff — in useful units. Let's start with the reasonable energy 
unit for a chemical bond, namely the electron volt or eV (Section 2.1). Then 
a useful unit for energy density is 



leV 

molecule 



( 3 . 8 ) 



This energy density is our invariant. As we convert from one unit system 
to another, our moves have to preserve the energy density. 



What are the legal moves — the moves that preserve the energy density? 



The legal moves are all ways of multiplying by 1 — for example, by 
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6 x 10 23 molecules 1 mol 

or . (3.9) 

1 mol 6 x 10 23 molecules 

Either quotient is a form of 1, because 1 mole is defined to be Avogadro's 
number of molecules, and Avogadro's number is 6 x 10 23 . I carefully wrote 
"1 mol" with the number rather than simply as "mol." The more explicit 
form reminds us that "6 x 10 2d molecules per mole" is shorthand for a quo- 
tient of two identical quantities: 6 x 10 2d molecules and 1 mole. 

Multiplying the energy density by the first form of 1 gives 

1 eV 6 x 10 23 jnoleeutes 6 x 10 23 eV 

X = . (3.10) 

molecule 1 mol mol 



(If we had multiplied by the second form of 1, the units of molecules would 
have become molecules squared instead of canceling. The strike-through 
lines help us check that we got the desired units.) The giant exponent makes 
this form almost meaningless. To improve it, let's multiply by another form 
of 1, based on the definition of an electron volt. Two forms of 1 are 



1.6 x 10“ 19 J 
leV 



or 



leV 

1.6 x 10 — 19 J 



(3.11) 



Multiplying by the first form of 1 gives 

1 jaV" 6 x 10 23 molecules' \ 6 x 10 -19 T 10 2 kf 

X X « (3.12) 

molecule" 1 mol 1 eV" mol 



(A more exact value is 96 kilojoules per mole.) In the United States, energies 
related to food are stated in Calories, also known as kilocalories (roughly 
4.2 kilojoules). In calorie units, the useful energy-density unit is 

96 .kf 1 kcal 23 kcal 

X « . (3.13) 

lmol 4.24^ mol 



► 



Which form is more meaningful: 23 or 



23 kcal ? 
mol 



The forms are mathematically equivalent: You can multiply by 23 before or 
after dividing by a mole. However, they are not psychologically equivalent. 
The first form builds the abstraction of kilocalories per mole, and then says, 
"Here are 23 of them." In contrast, the second form gives us the energy for 
1 mole, a human-sized amount. The second form is more meaningful. 

Similarly, the speed of light c is commonly quoted as (approximately) 
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3 x 10 8 — . 
s 

The psychologically fruitful alternative is 

3 x 10 8 m 
c = — ; • 



(3.14) 



(3.15) 



This form suggests that 300 million meters, at least for light, is the same as 
1 second. With this idea, you can convert wavelength to frequency (Prob- 
lem 3.14); with a slight extension, you can convert frequency to energy 
(Problem 3.15) and energy to temperature (Problem 3.16). 

Problem 3.12 Absurd units 

By multiplying by suitable forms of 1, convert 1 furlong per fortnight into meters 
per second. 

Problem 3.13 Rainfall units 

Rainfall, in nonmetric parts of the world, is sometimes measured in acre feet. By 
multiplying by suitable forms of 1, convert 1 acre foot to cubic meters. (One square 
mile is 640 acres.) 

Problem 3.14 Converting wavelength to frequency 

Convert green-light wavelength, 0.5 micrometers (0.5 pm), to a frequency in cycles 
per second (hertz or Hz). 

Problem 3.15 Converting frequency to energy 

Analogously to how you used the speed of light in Problem 3.14, use Planck's 
constant h to convert the frequency of green light to an energy in joules (J) and in 
electron volts (eV). This energy is the energy of a green-light photon. 

Problem 3.16 Converting energy to temperature 

Use Boltzmann's constant to convert the energy of a green-light photon (Prob- 
lem 3.15) to a temperature (in kelvin). This temperature, except for a factor of 3, is 
the surface temperature of the Sun! 



Conversion factors need not be numerical. Insight often comes from sym- 
bolic factors. Here is an example from fluid flow. As we will derive in 
Section 5.3.2, the drag coefficient c d is defined as the dimensionless ratio 



c d = 



F 



drag 




(3.16) 



where p is the fluid density, v is the speed of the object moving in the fluid, 
and A is the object's cross-sectional area. To give this definition and ratio a 





3.2 From invariant to symmetry operation 



69 



physical interpretation, multiply it by d/d, where d is the distance that the 
object travels: 



^drag d 

\l pv 2 Ad 



( 3 . 17 ) 



The numerator, F drag d, is the work done or the energy consumed by drag. 
In the denominator, the product Ad is the volume of fluid displaced by the 
object, so pAd is the corresponding mass of fluid. Therefore, the denomina- 
tor is also 



1 

2 



x mass of fluid displaced x v 2 . 



( 3 . 18 ) 



The object's speed v is also approximately the speed given to the displaced 
fluid (which the object shoved it out of its way). Therefore, the denominator 
is roughly 

- x mass of fluid displaced x (speed of displaced fluid) 2 . ( 3 . 19 ) 



This expression is the kinetic energy given to the displaced fluid. The drag 
coefficient is therefore roughly the ratio 

energy consumed by drag 
Cd energy given to the fluid 

My tenth-grade chemistry teacher, Mr. McCready, told us that unit conver- 
sion was the one idea that we should remember from the entire course. 
Almost every problem in the chemistry textbook could be solved by unit 
conversion, which says something about the quality of the book but also 
about the power of the method. 



3.2.2 Gauss's childhood sum 

A classic example of going from the invariant to the symmetry is the follow- 
ing story of the young Carl Friedrich Gauss. Although maybe just a legend, 
the story is so instructive that it ought to be true. Once upon a time, when 
Gauss was 3 years old, his schoolteacher, wanting to occupy the students, 
assigned them to compute the sum 

S = l+ 2 + 3 + -- + 100, ( 3 . 21 ) 

and sat back to enjoy the break. In a few minutes. Gauss returned with an 
answer of 5050. 
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Was Gauss right? If so, how did he compute the sum so quickly? 

Gauss saw that the sum — the invariant — is unchanged when the terms are 
added backward, from highest to lowest: 

1 + 2 + 3 4 b 100 = 100 + 99 + 98 H b 1. (3.22) 

Then he added the two versions of the sum, the original and the reflected: 

1+ 2 + 3 + - + 100= S 

+ 100 + 99 + 98 + ••■ + 1 = S (3 23) 

101 + 101 + 101 + - + 101 = 2 S. 

In this form, 2S is easy to compute: It contains 100 copies of 101. Therefore, 
2S = 100 x 101, and S = 50 x 101 or 5050 — as the young Gauss claimed. He 
made the problem so simple by finding a symmetry: a transformation that 
preserved the invariant. 

Problem 3.17 Number sum 

Use Gauss's method to find the sum of the integers between 200 and 300 (inclu- 
sive). 

Problem 3.18 Symmetry for algebra 

Use symmetry to find the missing coefficients in the expansion of ( a — b ) 3 : 

( a — b) 3 = a 3 — 3 a 2 b+? ab 2 + ? b 3 . (3.24) 

Problem 3.19 Integrals 

Evaluate these definite integrals. Hint: Use symmetry. 

10 OO 2 oo , 

(a) f x 3 e~ x ~ dx, (b) f * - dx, and (c) f nX dx. 

_ J 10 -L 1 + 7*2 + 18*8 W J 1 + H 



3.2.3 Finding maxima or minima 

To practice finding the symmetry operation, we'll find the maximum of the 
function 6x — x 2 without using calculus. Calculus is the elephant gun. It 
can solve many problems, but only after blasting them into the same form 
(smithereens). Avoiding calculus forces us to use more particular, but more 

subtle methods — such as symmetry As Gauss did in summing 1+2-1 b 

100, let's find a symmetry operation that preserves the essential feature of 
the problem — namely, the location of the maximum. 
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Symmetry implies moving around an object's pieces. Fortunately, our func- 
tion 6x — x 2 factors into pieces: 

6x — x 2 — x(6 — x) . (3.25) 

This form, along with the idea that multiplication is commutative, suggests 
the symmetry operation. For if the operation just swaps the two factors, 
replacing x(6 — x) with (6 — x)x, it does not change the location of the 
maximum. (A parabola has exactly one maximum or minimum.) 

The symmetry operation that makes the swap is 

X <— » 6 - X. (3.26) 

It turns 2 into 4 (and vice versa) and 1 into 5 (and vice versa). The only value 
unchanged (left invariant) by the symmetry operation is x = 3. Therefore, 
6x — x 2 has its maximum at x = 3. 

Geometrically, the symmetry operation reflects the graph of 
6x — x 2 through the line x = 3. By construction, this symme- 
try operation preserves the location of the maximum. There- 
fore, the maximum has to lie on the line x = 3. 

We could have found this maximum in several other ways, so 
the use of symmetry might seem superfluous or like overkill. 

However, it warms us up for the following, more compli- 
cated use. The energy required to fly has two pieces: gener- 
ating lift, which requires an energy A tv 2 , and fighting drag, which requires 
an energy Bv 2 . (A and B are constants that we estimate in Sections 3.6.2 and 
4.6.1.) 

Eflight = 4, + Bv 2 . (3.27) 

To minimize fuel consumption, planes choose their cruising speed to min- 
imize Eflighe More precisely, a cruising speed is selected, and the plane is 
designed so that this speed minimizes £ fljght . 

In terms of the constants A and B, what speed minimizes E^ jght ? 

Like the parabola x(6 — x), this energy has one extremum. For the parabola, 
the extremum was a maximum; here, it is a minimum. Also similar to the 
parabola, this energy has two pieces connected by a commutative opera- 
tion. For the parabola, the operation was multiplication; here, it is addition. 
Continuing the analogy, if we find a symmetry operation that transposes 
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the two pieces, then the speed preserved by the operation will be the mini- 
mum-energy speed. 

Finding this symmetry operation is hard to do in one gulp, because it must 
transpose 1/v 2 and v 2 and transpose A and B . These two difficulties suggest 
that we apply divide-and-conquer reasoning: Find a symmetry operation 
that transposes 1 / v 2 and v 2 , and then modify so that it also transposes A 
and B. 



To transpose 1 fv 2 and v 2 , the symmetry operation is 
the following: 



v 



1 

v' 



(3.28) 



Now let's restore one of the two constants and modify 
the symmetry operation so that it transposes A/v 2 and 



4a 

v 



(3.29) 




Now let's restore the second constant, B, and find the 
full symmetry operation that transposes A/v 2 and Bv 2 : 




v 



Rewriting it as a replacement for v, the symmetry operation becomes 



JAJB 

v 



(3.31) 



This symmetry operation transposes the drag energy and lift energy, leav- 
ing the total energy E flight unchanged. Solving for the speed preserved by 
the symmetry operation gives us the minimum-energy speed: 






1/4 



(3.32) 



In Section 4.6.1, once we find A and B in terms of the characteristics of the 
air (its density) and the plane (such as its wingspan), we can estimate the 
minimum-energy (cruising) speeds of planes and birds. 

Problem 3.20 Solving a quadratic equation using symmetry 

The equation 6x — x 2 + 7 = 0 has a solution at x = —1. Without using the quadratic 
formula, find any other solutions. 
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For a physical application of symmetry, imagine a uniform metal 
sheet, perhaps aluminum foil, cut into the shape of a regular pen- 
tagon. Imagine that to the edges are attached heat sources and 
sinks — big blocks of metal at a fixed temperature — in order to 
hold the edges at the temperatures marked on the figure. After 
we wait long enough, the temperature distribution in the pentagon stops 
changing (comes to equilibrium). 




Once the pentagon temperature equilibrates, what is the temperature at its center? 



A brute-force, analytic solution is difficult. Heat flow is described by the 
heat equation, a linear second-order partial-differential equation: 

, BP 

KV 2 T = , (3.33) 

where T is the temperature as a function of position and time, and k (kappa) 
is the thermal diffusivity (which we will study in more detail in Chapter 7). 
But don't worry: You do not have to understand the equation, only that it 
is difficult to solve! 

Once the temperature settles down, the time derivative becomes 
zero, and the equation simplifies to kV 2 T = 0. However, even this 
simpler equation has solutions only for simple shapes, and the 
solutions are complicated. For example, the temperature distrib- 
ution on the simpler square sheet is hardly intuitive (the figure 
shows contour lines spaced every 10°). For a pentagon, the temperature 
distribution is worse. However, because the pentagon is regular, symmetry 
might make the solution flow. 

What is a useful symmetry operation? 




Nature, in the person of the heat equation, does not care about the direc- 
tion of our coordinate system. Thus, rotating the pentagon about its center 
does not change the temperature at the center. Therefore, the following five 
orientations of the pentagon share the same central temperature: 




10 ° 



10 ° 



80° 



10 ° 



10 ° 
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Like Gauss adding the two versions of his sum (Section 3.2.2), stack these 
sheets mentally and add the temperatures that lie on top of each other to 
make the temperature profile of a new super sheet (adding the tempera- 
tures is valid because the heat equation is linear). 




(3.34) 



Each super edge contains one 80° edge and four 10° edges, for a temperature 
of 120°. The super sheet is a regular pentagon where all edges are at 120°. 
Therefore, the temperature throughout the sheet is 120° — including at the 
center. Because the symmetry operation has helped us construct a much 
easier problem, we did not have to solve the heat equation. 

One more step tells us the temperature in the center of the original sheet. 
The symmetry operation rotates the pentagon about its center; when the 
plates are stacked, the centers align. Each center then contributes one-fifth 
of the 120° in the center, so the original central temperature is 24°. 

To highlight the transferable ideas (abstractions), compare the symmetry 
solutions to Gauss's sum and to this temperature problem. First, both prob- 
lems seem complicated. Gauss's sum has many terms, all different; the 
pentagon problem seems to require solving a difficult differential equation. 
Second, both problems contain a symmetry operation. In Gauss's sum, 
the symmetry operation reversed the order of the terms; for the pentagon, 
the symmetry operation rotates it by 72°. Finally, the symmetry operation 
leaves an important quantity unchanged. For Gauss's problem, this quan- 
tity is the sum; for the pentagon, it is the central temperature. 

When there is change, look for what does not change. Look for invariants and the 
corresponding symmetries: the operations that preserve the invariants. 

Problem 3.21 Symmetry solution for a square sheet 

Here is the contour plot again of the temperature on a square 
sheet. The contour lines are separated by 10°. Use that infor- 
mation to label the temperature of each contour line. Based 
on the symmetry reasoning, what should the temperature at 
the center of the square be? Is this predicted temperature con- 
sistent with what is shown in the contour plot? 
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Problem 3.22 Simulating the heat equation 

Using symmetry, we showed that the temperature at the cen- 
ter of the pentagon is the average of the temperatures of the 
sides. Check the solution by simulating the heat equation 
with a pentagonal boundary. 

10 ° 

Problem 3.23 Shortest bisecting path 

What is the shortest path that bisects an equilateral triangle into two equal areas? 
Here are three examples of bisecting paths: 





To set your problem-solving gears in motion, first rank these three bisecting paths 
according to their lengths. 



3.4 Box models and conservation 

Invariance underlies a powerful everyday abstraction: box models. We al- 
ready made a box model in Section 3.1.1, to decide whether to run or walk 
in the rain. Now let's examine this method further. The simplest kind of 
box contains a fixed amount of stuff — perhaps the volume of fluid or the 
number of students at an ideal university (where every student graduates 
in a fixed time). Then what goes into the box must come out. This conclu- 
sion seems simple, even simplistic, but it has wide application. 

3.4.1 Supply and demand 

For another example of a box model, return to our 
estimate of US oil usage (Section 1.4). The flow 
into the box — the push or the supply — is the im- 
ported and domestically produced oil. The flow 
out of the box — the pull or the demand — is the oil usage. The estimate, 
literally taken, asks for the supply (how much oil is imported and domesti- 
cally produced). This estimate is difficult. Fortunately, as long as oil does 
not accumulate in the box (for example, as long as oil is not salted away 
in underground storage bunkers), then the amount of oil in the box is an 
invariant, so the supply equals the demand. To estimate the supply, we ac- 
cordingly estimated the demand. This conservation reasoning is the basis 
of the following estimate of a market size. 



oil 



demand 
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How ninny taxis are there in Boston, Massachusetts? 



For many car-free years, I lived in an old neighborhood of Boston. I often 
rode in taxis and wondered about the size of the taxi market — in particular, 
how many taxis there were. This number seemed hard to estimate, because 
taxis are scattered throughout the city and hard to count. 



The box contains the available taxi driving (mea- 
sured, for example, as time). It is supplied by 
taxi drivers. The demand is due to taxi users. As 
long as the supply and demand match, we can 
estimate the supply by estimating the demand. 



supply 


available 


demand 


(drivers) 


taxi driving 


(users) 



For estimating the demand, the starting point is that Boston has roughly 
500 000 residents. As a gut estimate, each resident uses maybe one taxi per 
month, for a 15-minute ride: Boston taxis are expensive; unless one doesn't 
own a car, it's hard to imagine using them more often than once a month 
or for longer than 15 minutes. Then the demand is about 10 5 hours of taxi 
driving per month: 

15 min 1 hr 10 5 hr 

5 x 10 s .residents x x « . (3.35) 

resident' month 60 min month 



How many taxi drivers will that many monthly hours support? 

Taxi drivers work long shifts, maybe 60 hours per week. I'd guess that they 
carry passengers one-half of that time: 30 hours per week or roughly 100 
hours per month. At that pace, 10 5 hours of monthly demand could be 
supplied by 1000 taxi drivers or, assuming each taxi is driven by one driver, 
by 1000 taxis. 

What about tourists? 

Tourists are very short-term Boston residents, mostly without cars. Tourists, 
although fewer than residents, use taxis more often and for longer than 
residents do. To include the tourist contribution to taxi demand. I'll simply 
double the previous estimate to get 2000 taxis. 

This estimate can be checked reliably, because Boston is one of the United 
States cities where taxis may pick up passengers only with a special permit, 
the medallion. The number of medallions is strictly controlled, so medal- 
lions cost a fortune. For about 60 years, their number was restricted to 1525, 
until a 10-year court battle got the limit raised by 260, to about 1800. 
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The estimate of 2000 may seem more accurate than it deserves. However, 
chance favors the prepared mind. We prepared by using good tools: a box 
model and divide-and-conquer reasoning. In making your own estimates, 
have confidence in the tools, and expect your estimates to be at least half 
decent. You will thereby find the courage to start: Optimism oils the rails 
of estimation. 

Problem 3.24 Differential equation for an RC circuit 

Explain how a box model leads to the differential 

equation for the low-pass RC circuit of Section 2.4.4: v in 

RC d -^f + Tout = v in . ( 3 . 36 ) 

(Almost every differential equation arises from a box 

or conservation argument.) 

Problem 3.25 Boston taxicabs tree 

Draw a divide-and-conquer tree for estimating the number of Boston taxicabs. 

First draw it without estimates. Then include your estimates, and propagate the 

values toward the root. 

Problem 3.26 Needles on a Christmas tree 

Estimate the number of needles on a Christmas tree. 




3.4.2 Flux 



Flows, such as the demand for oil or the supply of taxi cabs, 
are rates — an amount per time. Physical flows are also rates, 
but they live in a geometry. This embedding allows us to de- 
fine a related quantity: flux. 

a r , rr rate amount of stuff 

flux of stuff = = . (3.37) 

area area x time 



flow rate 



amount 

time 




For example, particle flux is the rate at which particles (say, 
molecules) pass through a surface perpendicular to the flow, divided by 
the area of the surface. Dividing by the surface area, an operation with no 
counterpart in nonphysical flows (for example, in the demand for taxicabs), 
makes flux more invariant and useful than rate. For if you double the sur- 
face area, you double the rate. This proportionality is not newsworthy, and 
usually doesn't add insight, only clutter. When there is change, look for 
what does not change: Even when the area changes, flux does not. 
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Problem 3.27 Rate versus amount 

Explain why rate (amount per time) is more useful than amount. 

Problem 3.28 What is current density? 

What kind of flux (flux of what?) is current density (current per area)? 

The definition of flux leads to a simple and important connection between 
flux and flow speed. Imagine a tube of stuff (for example, molecules) with 
cross-sectional area A. The stuff flows through the tube at a speed v. 

In a time t, how much stuff leaves the tube? 



In the time t, the stuff in the shaded chunk, 
spanning a length vt, leaves the tube. This 
chunk has volume Avt . The amount of stuff 
in that volume is 
stuff 



volume 



x Avt. 



(3.38) 



density of stuff volume 







The amount of stuff per volume, the density of stuff, occurs so often that 
it usually gets a special symbol. When the stuff is particles, the density is 
labeled n for number density (in contrast to N for the number itself). When 
the stuff is charge or mass, the density is labeled p. 



From the amount of stuff, we can find the flux: 



flux of stuff = 



amount of stuff 
area x time 



Avt 

density of stuff x volume 
area x time 

At 



(3.39) 



The product At cancels, leaving the general relation 

flux of stuff = density of stuff x flow speed. (3.40) 

As a particular example, when the stuff is charge (Problem 3.28), the flux of 
stuff becomes charge per time per area, which is current per area or current 
density. With that label for the flux, the general relation becomes 

current density = charge density x flow speed , (3.41) 

7 } ^77 

where u drift is the flow speed of the charge — which you will estimate in 
Problem 6.16 for electrons in a wire. 
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The general relation will be crucial in estimating the power required to fly 
(Section 3.6) and in understanding heat conduction (Section 7.4.2). 



3.4.3 Average solar flux 

An important flux is energy flux: the rate at which energy passes through 
a surface, divided by the area of the surface. Here, rate means energy per 
time, or power. Therefore, energy flux is power per area. An energy flux 
essential to life is the solar flux: the solar power per area falling on Earth. 
This flux drives most of our weather. At the top of the atmosphere, looking 
directly toward the sun, the flux is roughly F = 1300 watts per square meter. 

However, this flux is not evenly distributed over the surface of the earth. 
The simplest reason is night and day. On the night side of the Earth, the 
solar flux is zero. More subtly, different latitudes have different solar fluxes: 
The equatorial regions are warmer than the poles because they receive more 
solar flux than the poles do. 

What is the solar flux averaged over the whole Earth? 

We can find the average flux using a box model (a sunlight 
conservation argument). Here is sunlight coming 
to the Earth (with parallel rays, because the Sun is 
so far away). Hold a disk with radius R Earth perpen- 
dicular to the sunlight so that it blocks all sunlight that the Earth otherwise 
would get. The disk absorbs a power that we can find from the energy flux: 

power = energy flux x area = FnR^ : . h , (3.42) 




where F is the solar flux. Now spread this power over the whole Earth, 
which has surface area 47T Aj; arth : 



power 

average flux = 

surface area 



FnR 



2 

Earth 



^^Earth 



F 

4' 



(3.43) 



Because one-half of the Earth is in night, averaging over the night and day- 
light parts of the earth accounts for a factor of 2. Therefore, averaging over 
latitudes must account for another factor of 2 (Problem 3.29). 

Problem 3.29 Averaging solar flux over all latitudes 

Integrate the solar flux over the whole sunny side of the Earth, accounting for the 
varying angles between the incident sunlight and the surface. Check that the result 
agrees with the result of the box model. 
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The result is roughly 325 watts per square meter. This average flux slightly 
overestimates what the Earth receives at ground level, because not all of the 
1300 watts per square meter hitting the top of the atmosphere reaches the 
surface. Roughly 30 percent gets reflected near the top of the atmosphere 
(by clouds). The surviving amount is about 1000 watts per square meter. 
Averaged over the surface of the Earth, it becomes 250 watts per square me- 
ter (which then goes into the surface and the atmosphere), or approximately 
F/5, where F is the flux at the top of the atmosphere. 



3.4.4 Rainfall 



These 250 watts per square meter determine characteristics of our weather 
that are essential to life: the average surface temperature and the average 
rainfall. You get to estimate the surface temperature in Problem 5.43, once 
you learn the reasoning tool of dimensional analysis. Here, we will estimate 
the average rainfall. 



evaporation 


water in 


rainfall 




atmosphere 





If the box representing the atmosphere holds a 
fixed amount of water — and over a long timescale, 
the amount is constant (it is our invariant) — then 
what goes into the box must come out of the box. 

The inflow is evaporation; the outflow is rain. Therefore, to estimate the 
rainfall, estimate the evaporation — which is produced by the solar flux. 



► 



How much rainfalls on Earth? 



Rainfall is measured as a height of water per time — typically, inches or mil- 
limeters per year. To estimate global average rainfall, convert the supply of 
solar energy to the supply of rainwater. In other words, convert power per 
area to height per time. The structure of the conversion is 
power ? height 

X - = — 2— , (3.44) 

area ? time 



where ?/? represents the conversion factor that we need to determine. To 
find what this conversion factor represents, we multiply both sides by area 
per power. The result is 

? area x height volume 

- = = . (3.45) 

? power x time energy 



What physical quantity could this volume per energy be? 
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We are trying to determine the amount of rain, so the volume in the numer- 
ator must be the volume of rain. Evaporating the water requires energy, so 
the energy in the denominator must be the energy required to evaporate 
that much water. The conversion factor is then the reciprocal of the heat of 
vaporization of water L vap , but expressed as an energy per volume. In Sec- 
tion 1.7.3, we estimated L vap as an energy per mass. To make it an energy 
per volume, just multiply by a mass per volume — namely, by p water : 



energy mass' energy 

X = . (3.46) 

mass' volume volume 



bap ^ water iVvaterk-ap 

Our conversion factor, volume per energy, is the reciprocal, l/p water L 
Our estimate for the average rainfall then becomes 



vap* 



solar flux going to evaporate water 



Pwater^- 1 ' 



vap 



(3.47) 



For the numerator, we cannot just use F, the full solar flux at the top of the 
atmosphere. Rather, the numerator incorporates several dimensionless ra- 
tios that account for the hoops through which sunlight must jump in order 
to reach the surface and evaporate water: 



0.25 averaging the intercepted flux over the whole surface of the Earth 
(Section 3.4.3) 

0.7 the fraction not reflected at the top of the atmosphere 
0.7 of the sunlight not reflected, the fraction reaching the surface (the 
other 30 percent is absorbed in the atmosphere) 
x 0.7 of the sunlight reaching the surface, the fraction reaching the oceans 
(the other 30 percent mostly warms land) 

= 0.09 fraction of full flux F that evaporates water (including averaging the 
full flux over the whole surface) 



The product of these four factors is roughly 9 percent. With L vap = 2.2 x 
10 6 joules per kilogram (which we estimated in Section 1.7.3), our rainfall 
estimate becomes roughly 

F fraction 



1300 Wm~ 2 x 0,09 _ 5.3xl0~ 8 m 

10 3 kgm -3 x 2.2xl0 6 Jkg -3 s 



(3.48) 
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The length in the numerator is tiny and hard to perceive. Therefore, the 
common time unit for rainfall is a year rather than a second. To convert the 
rainfall estimate to meters per year, multiply by 1: 

5.3 x 10 -8 m 3xl0 7 ^ 1.6m 

X « (3.49) 

g 1 yr yr 

(about 64 inches per year). Not bad: Including all forms of falling water, 
such as snow, the world average is 0.99 meters per year — slightly higher 
over the oceans and slightly lower over land (where it is 0.72 meters per 
year). The moderate discrepancy between our estimate and the actual aver- 
age arises because some sunlight warms water without evaporating it. To 
reflect this effect, our table on page 81 needs one more fraction (« 2/3). 

Problem 3.30 Solar luminosity 

Estimate the solar luminosity — the power output of the Sun (say, in watts) — based 

on the solar flux at the top of the Earth's atmosphere. 

Problem 3.31 Total solar power falling on Earth 

Estimate the total solar power falling on the Earth's surface. How does it compare 

to the world energy consumption? 

Problem 3.32 Explaining the difference between ocean and land rainfall 

Why is the average rainfall over land lower than over the ocean? 



3.4.5 Residence times 

Because of evaporation, the atmosphere contains a lot of water: roughly 
1.3 x 10 16 kilograms — as vapor, liquid, and solid. This mass tells us the res- 
idence time: how long a water molecule remains in the atmosphere before 
it falls back to the Earth as precipitation (the overall name for rain, snow, or 
hail). The estimate will illustrate a new way to use box models. 

Here is the box representing the water in the atmosphere (assumed to need 
only one box). The box is filled by evaporation and emptied by rainfall. 



evaporation 


^ water 


rainfall 




in atmosphere 





Imagine that the box is a water hose holding a mass m wa ter . How long does 
a water molecule take to get from one end of the hose to other? This time 
is the average time taken by a water molecule from evaporation until its 
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return to the Earth as precipitation. In the box model, the time is the time 
to completely fill the box. This time constant, denoted r, is 
mass of water in the atmosphere 

T = H ^ . (3.50) 

rate ot mflow or outflow, as a mass per time 

The numerator is m water . For the denominator, we convert rainfall, which 
is a speed (for example, in meters per year), to a mass flow rate (mass per 
time). Let's name the rainfall speed ^rainfall- The corresponding mass flux 
is, using our results from Section 3.4.2, p wa ter ^rainfall : 

mass flux = density x flow speed = p wa terAainfaii- (3.51 ) 

water ^rainfall 

Flux is flow per area, so we multiply mass flux by the Earth's surface area 
A i a rth t° get the mass flow: 

maSS flow Mwater ^rainfall ^ Earth * (3.52) 



At this rate, the fill time is 

' ' ^water 



T = 



j^water^rainfall -^Earth 



(3.53) 



There are two ways to evaluate this time: the direct but less insightful 
method, and the less direct but more insightful method. Let's first do the 
direct method, so that we at least have an estimate for r: 



1.3 x 10 16 kg 

10 3 kg m -3 x 1 myr -1 x 47T x (6 x 10 6 m) 2 



2.5 x 10 2 yr. 



(3.54) 



which is roughly 10 days. Therefore, after evaporating, water remains in 
the atmosphere for roughly 10 days. 



For the less direct but more insightful method, notice which quantities are 
not reasonably sized — that is, not graspable by our minds — namely, m water 
and A Earth . But the combination m watcr / p watQr A Earth is reasonably sized: 



m 



water 



1.3 x 10 16 kg 



Mwater ^Earth 10 3 kgm“ 3 X 47T X (6 X 10 6 m) 2 



2.5 x 10 -2 m. 



(3.55) 



This length, 2.5 centimeters, has a physical interpretation. If all water, snow, 
and vapor fell out of the atmosphere to the surface of the Earth, it would 
form an additional global ocean 2.5 centimeters deep. 

Rainfall takes away 100 centimeters per year. Therefore, draining this ocean, 
with a 2.5-centimeter depth, requires 2.5 x 10 -2 years or about 10 days. This 
time is, once again, the residence time of water in the atmosphere. 
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3.5 Drag using conservation of energy 

A box model will next help us estimate drag forces. Drag, one of the most 
difficult subjects in physics, is also one of the most important forces in every- 
day life. If it weren't for drag, bicycling, flying, and driving would be a 
breeze. Because of drag, locomotion requires energy. Rigorously calculat- 
ing a drag force requires solving the Navier-Stokes equations: 

9v 1 o 

(v-V)v + — = — Vp + W 2 v. (3.56) 

df p 

They are coupled, nonlinear, partial-differential equations. You could read 
many volumes describing the mathematics to solve these equations. Even 
then, solutions are known only in a few circumstances — for example, a 
sphere moving slowly in a viscous fluid or moving at any speed in a non- 
viscous fluid. However, a nonviscous fluid — what Feynman [14, Section 
II-40-2], quoting John von Neumann, rightly disparages as "dry water" — is 
particularly irrelevant to real life because viscosity is the cause of drag, so a 
zero-viscosity solution predicts zero drag! Using a box model and conser- 
vation of energy is a simple and insightful alternative. 



3.5.1 Box model for drag 

We will first estimate the energy lost to drag as an ob- 
ject moves through a fluid, as in Section 3.2.1. From the 
energy, we will find the drag force. To quantify the the 
problem, imagine pushing an object of cross-sectional 
area A cs at speed v for a distance d. The object sweeps < 

(The tube length d is arbitrary, but it will cancel out of tl 

How much energy is consumed by drag? 

Energy is consumed because the object gives kinetic energy to the fluid 
(say, water or air); viscosity, as we will model in Section 6.4.4, then turns 
this energy into heat. The kinetic energy depends on the mass of the fluid 
and on the speed it is given. The mass of fluid in the tube is pA cs d, where 
p is the fluid density. The speed imparted to the fluid is roughly the speed 
of the object, which is v. Therefore, the kinetic energy given to the fluid is 
roughly pA cs v 2 d : 

^kinetic ~ P A cs d XV 2 = pA cs V 2 d. (3.57) 




it a tube of fluid, 
force.) 



mass 
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This calculation ignores the factor of one-half in the definition of kinetic 
energy. However, the other approximations, such as assuming that only 
the swept-out fluid is affected or that all the swept-out fluid gets speed v, 
are at least as inaccurate. For this rough calculation, there is little point in 
including the factor of one-half. 

This kinetic energy is roughly the energy converted into heat. Therefore, 
the energy lost to drag is roughly pA cs v 2 d. The drag force is then given by 

energy lost to drag = drag force x distance . (3.58) 

~pA a V Z d htag d 

Now we can solve for the drag force: 

fdrag ~ PA csV 2 - (3-59) 

As expected, the arbitrary distance d has canceled out. 

3.5.2 Testing the analysis with a home experiment 

To test this analysis, try the following home experiment. Photocopy or 
print this page at 200 percent enlargement (a factor of 2 larger in width 
and height), cut out the template, and tape the two straight edges together 
to make a cone: 



We could use many other shapes. However, a cone is easy to construct, and 
also falls without swishing back and forth (as a sheet of paper would) or 
flipping over (as long as you drop it point down). 

We'll test the analysis by predicting the cone's terminal speed: that is, 
its steady speed while falling. When the cone is falling at this constant 
speed, its acceleration is zero, so the net force on it is, by Newton's second 
law, also zero. Thus, the drag force F drag equals the cone's weight mg 
(where m is the cone's mass and g is the gravitational acceleration): 

p ai yA cs ~ mg. 





(3.60) 



(3.61) 
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The terminal speed thus reveals the drag force. (Even though the drag force 
equals the weight, the left side is only an approximation to the drag force, 
so we connect the left and right sides with a single approximation sign ~.) 
The terminal speed zi term is then 

I mg 

Herm ~ ■ 0-62) 

V ^csr air 

The mass of the cone is 



m = A paper x areal density of paper. 

^paper 



(3.63) 



Here, A paper is the area of the cone template; and the areal density (7 paper , 
named in analogy to the regular (volume) density, is the mass per area of 
paper. Although areal density seems like a strange quantity to define, it is 
used worldwide to describe the "weight" of different papers. 



The quotient m/A cs contains the ratio A paper /A cs . Rather than estimating 
both areas and finding their ratio, let's estimate the ratio directly. 

How does the cross-sectional area A cs compare to the area of the paper? 

Because the cone's circumference is three-quarters of the circum- 
ference of the full circle, its cross-sectional radius is three-quar- 
ters of the radius r of the template circle. Therefore, 




Because the template is three-quarters of a full circle, 
A = — 77^2 

-^paper 4 1 L ' * 



(3.65) 



The paper area has one factor of three-quarters, whereas the cross-sectional 
area has two factors of three-quarters, so A paper /A cs = 4/3. Now u term sim- 
plifies as follows: 



1/2 



1/2 



y term 



A ry 

^paper u paper 



x g 



-<J. 



paper 



8 



(3.66) 



The only unfamiliar number is the areal density cr aper , the mass per area 
of paper. Fortunately, areal density is used commercially, so most reams of 
printer paper state their areal density: typically, 80 grams per square meter. 




3.5 Drag using conservation of energy 



87 



► 



Is this cr paper consistent with the estimates for a dollar bill in Section 1.1? 



There we estimated that the thickness t of a dollar bill, or of paper in general, 
is approximately 0.01 centimeters. The regular (volume) density p would 
then be 0.8 grams per cubic centimeter: 



c, 



-2 



1 nr 



^paper 



paper 80 gm 

t 10 -2 cm 10 4 cm 2 



0.8 



£ 

cm 3 



( 3 . 67 ) 



This density, slightly below the density of water, is a good guess for the 
density of paper, which originates as wood (which barely floats on water). 
Therefore, our estimate in Section 1.1 is consistent with the proposed areal 
density of 80 grams per square meter. 



After putting in the constants, the cone's terminal speed is predicted to be 
roughly 0.9 meters per second: 



1/2 



V, 



term 



4 8xl0 2 kgm 2 xl0ms 
3 1.2 kg m -3 



0.9 ms 



-l 



( 3 . 68 ) 



To test the prediction and, with it, the analysis justifying it, I held the cone 
slightly above my head, from about 2 meters high. After I let the cone go, 
it fell for almost exactly 2 seconds before it hit the ground — for a speed of 
roughly 1 meter per second, very close to the prediction. Box models and 
conservation triumph again! 



3.5.3 Cycling 

In introducing the analysis of drag, I said that drag is one of the most impor- 
tant physical effects in everyday life. Our analysis of drag will now help us 
understand the physics of a fantastically efficient form of locomotion — cy- 
cling (for its efficiency, see Problem 3.34). 

What is the world-record cycling speed? 

The first task is to define the kind of world record. Let's analyze cycling on 
level ground using a regular bicycle, even though faster speeds are possible 
riding downhill or on special bicycles. In bicycling, energy goes into rolling 
resistance, friction in the chain and gears, and air drag. The importance of 
drag rises rapidly with speed, due to the factor of v 2 in the drag force, so at 
high-enough speeds drag is the dominant consumer of energy. 
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Therefore, let's simplify the analysis by assuming that drag is the only con- 
sumer of energy. At the maximum cycling speed, the power consumed by 
drag equals the maximum power that the rider can supply. The problem 
therefore divides into two estimates: the power consumed by drag (P drag ) 
and the power that an athlete can supply (P at hi e te)- 

Power is force times velocity: 

energy force x distance , , ., , 

power = = = force x velocity. (3.69) 

time time 



Therefore, 



rag Pdrag^max 



Setting P drag = P, 



athlete 



1 athlete 

Pair -^cs 



~ P^ A cs- 

allows us to solve for the maximum speed: 

1/3 



(3.70) 



(3.71) 



where A cs is the cyclist's cross-sectional area. In Section 1.7.2, we estimated 
Pathiete as 300 watts. To estimate the cross-sectional area, divide it into a 
width and a height. The width is a body width — say, 0.4 meters. A racing 
cyclist crouches, so the height is roughly 1 meter rather than a full 2 meters. 
Then A cs is roughly 0.4 square meters. 



Plugging in the numbers gives 



300 W 



1 kgm 3 x 0.4 m 2 



1/3 



(3.72) 



That formula, with its mix of watts, meters, and seconds, looks suspicious. Are the 
units correct? 



Let's translate a watt stepwise into meters, kilograms, and seconds, using 
the definitions of a watt, joule, and newton: 



W= 



J = Nm, 



kgm 
N = -A— . 



(3.73) 



The three definitions are represented in the next divide-and-conquer tree, 
one definition at each nonleaf node. Propagating the leaves toward the root 
gives us the following expression for the watt in terms of meters, kilograms, 
and seconds (the fundamental units in the SI system): 
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The units in v max become 
/ w v L 3 
iqpn^s -3 
.kg' m -3 x mt 2 ' 




( 3 . 75 ) 



The kilograms cancel, as do the square meters. The 
cube root then contains only meters cubed over sec- 
onds cubed; therefore, the units for v max are meters per 
second. 




Let's estimate how many meters per second. Don't let 
the cube root frighten you into using a calculator. We 
can do the arithmetic mentally, if we massage (adjust) k 8 
the numbers slightly If only the power were 400 watts 
(or instead the area were 0.3 square meters)! Instead of wishing, make it 
so — and don't worry about the loss of accuracy: Because we have neglected 
the drag coefficient, our speed will be approximate anyway. Then the cube 
root becomes an easy calculation: 




300:400 W 
1 kgm -3 x 0.4 m 2 



In more familiar units, the record speed is 22 miles per hour or 36 kilome- 
ters per hour. As a comparison, the world 1-hour record — cycling as far as 
possible in 1 hour — is 49.7 kilometers or 30.9 miles, set in 2005 by Ondrej 
Sosenka. Our prediction, based on the conservation analysis of drag, is 
roughly 70 percent of the actual value. 



How can such an estimate be considered useful? 



High accuracy often requires analyzing and tracking many physical effects. 
The calculations and bookkeeping can easily obscure the most important 
effect and its core idea, costing us insight and understanding. Therefore, 
almost everywhere in this book, the goal is an estimate within a factor of 2 
or 3. That level of agreement is usually enough to convince us that our 
model contains the situation's essential features. 

Here, our predicted speed is only 30 percent lower than the actual value, so 
our model of the energy cost of cycling must be broadly correct. Its main 
error arises from the factor of one-half that we ignored when estimating the 
drag force — as you can check by doing Problem 3.33. 
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3.5.4 Fuel efficiency of automobiles 

Bicycles, in many places, are overshadowed by cars. From the analysis of 
drag, we can estimate the fuel consumption of a car (at highway speeds). 
Most of the world measures fuel consumption in liters of fuel per 100 kilo- 
meters of driving. The United States uses the reciprocal quantity, fuel effi- 
ciency — distance per volume of fuel — measured in miles per US gallon. To 
develop unit flexibility, we'll do the calculation using both systems. 

For a bicycle, we compared powers: the power consumed by drag with the 
power supplied by an athlete. For a car, we are interested in the fuel con- 
sumption, which is related to the energy contained in the fuel. Therefore, 
we need to compare energies. For cars traveling at highway speeds, most 
of the energy is consumed fighting drag. Therefore, the energy consumed 
by drag equals the energy supplied by the fuel. 

Driving a distance d, which will be 100 kilometers, consumes an energy 

^drag ~ P&iiP ^cs ^ • (3.77) 



The fuel provides an energy 

Efuei ~ ^energy density x fuel mass = £fuei Ffud^fuei- 

Because E fuel 






j^fuel 7f L: 



E drag , ^e volume of fuel required is given by 



V, 



fuel 



V 2 A c 



Pi uel^fuel j^fuel ^fuel 



-'drag 



d. 



^consumption 



(3.78) 



(3.79) 



Because the left-hand side, U fuel , is a volume, the complicated factor in front 
of the travel distance d must be an area. Let's make an abstraction by nam- 
ing this area. Because it is proportional to fuel consumption, a self-docu- 
menting name is A consumption . Now let's estimate the quantities in it. 

1. Density ratio p a i r /Pf ue i ■ The density of gasoline is similar to the density of 
water, so the density ratio is roughly 10 j 

2. Speed v. A highway speed is roughly 100 kilometers per hour (60 miles 
per hour) or 30 meters per second. (A useful approximation for Ameri- 
cans is that 1 meter per second is roughly 2 miles per hour.) 

3. Energy density £r uel . We estimated this quantity Section 2.1 as roughly 
10 kilocalories per gram or 40 megajoules per kilogram. 
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4. Cross-sectional area A cs . A car's cross section is about 2 me- 
ters across by 1.5 meters high, so A cs ~ 3 square meters. 

With these values. 




1.5m 



A, 



consumption 



io- 



10 3 m 2 s 2 x 3m 2 
4 x 10 7 Jkg -1 



8 x 10 -8 m 2 . 



2m 



(3.80) 



To find the fuel consumption, which is the volume of fuel per 100 kilometers 
of driving, simply multiply A consumption by d = 100 kilometers or 10 5 meters, 
and then convert to liters to get 8 liters per 100 kilometers: 



V, 



fuel 



8 x 10 8 jn 2 ' x lO 5 ^ 



10 3 € 
pm 3 ' 



8t. 



1 consumption 



(3.81) 



For the fuel efficiency, we use A consumption in the form d = V' fuel /A consumption 
to find the distance traveled on 1 gallon of fuel, converting the gallon to 
cubic meters: 



d 



Wei 

1 safkrrT AJt 10~ 3 m 3 

— — x x = 5 x 10 4 m. 

8 x 10 _8 .m > 1 gahorf L t 

A 

-^consumption 



(3.82) 



The struck-through exponent of 3 in the m indicates that the cubic me- 
ters became linear meters, as a result of cancellation with the m 2 in the 
/'consumption- The resulting distance is 50 kilometers or 30 miles. The pre- 
dicted fuel efficiency is thus roughly 30 miles per gallon. 

This prediction is very close to the official values. For example, for new mid- 
size American cars (in 2013), fuel efficiencies of nonelectric vehicles range 
from 16 to 43 miles per gallon, with a mean and median of 30 miles per 
gallon (7.8 liters per 100 kilometers). 

The fuel-efficiency and fuel-consumption predictions are far more accurate 
than we deserve, given the many approximations! For example, we ignored 
all energy losses except for drag. We also used a very rough drag force 
p air u 2 A cs , derived from a reasonable but crude conservation argument. Yet, 
like Pippi Longstocking, we came out right anyway. 



What went right? 
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The analysis neglects two important factors, so such accuracy is possible 
only if these factors cancel. The first factor is the dimensionless constant 
hidden in the single approximation sign of the drag force: 

^drag ~ £air^cs^ • (3.83) 



Including the dimensionless prefactor (shown in gray), the drag force is 



p _ 

1 drag 



l 



hi £air^cs^ ' 



(3.84) 



where c d is the drag coefficient (introduced in Section 3.2.1). The factor of 
one-half comes from the one-half in the definition of kinetic energy. The 
drag coefficient is the remaining adjustment, and its origin is the subject of 
Section 5.3.2. For now, we need to know only that, for a typical car, c d « 1 /2. 
Therefore, the dimensionless prefactor hidden in the single approximation 
sign is approximately 1 /4. 



Based on this more accurate drag force, will cars use more or less than 8 liters of 
fuel per 100 kilometers? 



Including the c d /2 reduces the drag force and the fuel consumption by a fac- 
tor of 4. Therefore, cars would travel 120 miles on 1 gallon of fuel or would 
consume only 2 liters per 100 kilometers. This more careful prediction is 
far too optimistic — and far worse than the original, simpler estimate. 

What other effect did we neglect? 



The engine efficiency — a typical combustion engine, whether gasoline or 
human, is only about 25 percent efficient: An engine extracts only one-quar- 
ter of the combustion energy in the fuel; the remaining three-quarters turns 
into heat without doing mechanical work. Including this factor increases 
our estimate of the fuel consumption by a factor of 4. 



The engine efficiency and the more accurate drag force together give the 
following estimate of the fuel consumption, with the new effect in gray: 



^fue! 



2^d ^ PaiyV ^cs ^ 

0-25 ft uel £f uel 



(3.85) 



The 0.25 in the denominator, from the engine efficiency, cancels the \c d in 
the numerator. That is why our carefree estimate, which neglected both fac- 
tors, was so accurate. The moral, which I intend only half jokingly: Neglect 
many factors, so that the errors can cancel one another out. 
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Problem 3.33 Adjusting the cycling record 

Our estimate of the world 1-hour record as roughly 35 kilometers (Section 3.5.3) 
ignored the drag coefficient. For a bicyclist, c d ~ 1. Will including the drag coeffi- 
cient improve or worsen the prediction in comparison with the actual world record 
(roughly 50 kilometers)? Answer that question before making the new prediction! 
What is the revised prediction? 

Problem 3.34 Bicyclist fuel efficiency 

What is the fuel consumption and efficiency of a bicyclist powered by peanut but- 
ter? Express your estimate as an efficiency (miles per gallon of peanut butter) and 
a consumption (liters of peanut butter per 100 kilometers). How does a bicycle 
compare with a car? 



3.6 Lift using conservation of momentum 

If drag is a drag, our next force, which is the companion to drag, should lift 
our spirits. Using conservation and box models, we will estimate the power 
required to generate lift. There are two main cases: hovering flight — for 
example, a hummingbird — and forward flight. Compared to forward flight, 
hovering flight has one fewer parameter (there is no forward velocity), so 
let's begin with its analysis, for a bird of mass m . 

3.6.1 Hovering: Hummingbirds 

How much power does a hummingbird require to hover? 

Hovering demands power because a hummingbird has weight: 

The Earth, via the gravitational field, supplies the hummingbird 
with downward momentum. The Earth therefore loses downward 
momentum or, equivalently, acquires upward momentum. (Thus, 
the Earth accelerates upward toward the hummingbird, although 
very, very slowly.) This flow of momentum can be tracked with 
a box model. Let's draw the box around the Earth-hummingbird 
system and imagine the system as the whole universe. The box contains 
a fixed (constant) amount of downward momentum, so the gravitational 
field can transfer downward momentum only within the box. In particular, 
the field transfers downward momentum from the Earth to the humming- 
bird. This picture is a fancy way of saying that the Earth exerts a downward 
force on the hummingbird, but the fancy way shows us what the hovering 
hummingbird must do to stay aloft. 



box 
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If the hummingbird keeps this downward momentum, it would accu- 
mulate downward speed — and crash to the ground. Fortunately, the 
box has one more constituent: the fluid (air). The hummingbird gives 
the downward momentum to the air: It flaps its wings and sends air 
downward. Lift, like drag, requires a fluid. (The air pushes down on 
the Earth, returning the downward momentum that the Earth lost 
via the gravitational field. Thus, the Earth does not accelerate.) 

How much power is required to send air downward? 

Power is force times speed. The force is the gravitational force mg that the 
hummingbird is unloading onto the air. Estimating the air's downward 
speed v z requires careful thought about the flow of momentum. The air car- 
ries the downward momentum supplied to the hummingbird. The 
momentum supply (a momentum rate or momentum per time) is 
the force mg: Force is simply momentum per time. Because mo- 
mentum flux is momentum per time per area, 

mg = momentum flux x area. (3.86) 

When we first studied flux, in Section 3.4.2, we derived that 

^ Pair^z 

flux of stuff = density of stuff x flow speed. (3.87) 

Because our stuff is momentum, this relation takes the particular form 

momentum flux = momentum density x flow speed. (3.88) 

Substituting this momentum flux into mg = momentum flux x area, 

mg = momentum density x flow speed x area. (3.89) 

Momentum density is momentum (m aiI v z ) per volume, so it is p a ir v z . The 
flow speed is v z . Thus, 

mg = p air v z x v z x area = p air u 2 x area. (3.90) 

To complete this equation, so that it gives us the downward velocity v z , we 
need to estimate the area. It is the area over which the hummingbird di- 
rects air downward. It is roughly L 2 , where L is the wingspan (wingtip to 
wingtip). Even though the wings do not fill that entire area, the relevant 
area is still L 2 , because the wings disturb air in a region whose size is com- 
parable to their longest dimension. (For this reason, high-efficiency planes, 
such as gliders, have very long wings.) 

Using L 2 as the estimate for the area, we get 




box 
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mg ~ Pai^L 2 , 

so the downward velocity is 




(3.91) 



(3.92) 



With this downward velocity and with the downward force mg, the power 
P (not to be confused with momentum!) is 



P = Fv z ~ mg 



mg 



r L 2 



(3.93) 



Let's estimate this power for an actual hummingbird: the Calliope hum- 
mingbird, the smallest bird in North America. Its two relevant characteris- 
tics are the following: 

wingspan L « 11 cm, 

(3.94) 

mass m « 2.5 g. 



As the first step in estimating the hovering power, we'll estimate the down- 
ward air speed using our formula for v z . The result is that, to stay aloft, the 
hummingbird sends air downward at roughly 1.3 meters per second: 



mg 



Vy 



2.5 x 10 -2 N 



1.2 kg m 3 x 1.2x10 2 m 2 



1/2 



1.3ms 



-l 



Air L 2 

The resulting power consumption is roughly 30 milliwatts: 
P~ 2.5 x 10 -2 N x 1.3ms- 1 « 3xlO“ 2 W . 



(3.95) 



(3.96) 



mg 



30 mW 



(Because animal metabolism, like a car engine, is only about 25 percent effi- 
cient, the hummingbird needs to eat food at a rate corresponding to 120 
milliwatts.) 

This power seems small: Even an (incandescent) flashlight bulb, for exam- 
ple, requires a few watts. However, as a power per mass, it looks more 
significant: 

3 x 10~ 2 W w iqW 
rn 2.5 x 10~ 3 kg kg 

In comparison, the world-champion cyclist Lance Armstrong, with one of 
the highest human power outputs, was measured to have a power output of 




96 



3 Symmetry and conservation 



7 watts per kilogram (Section 1.7.2). However, for a chemically unenhanced 
world-class athlete, 5 watts per kilogram is a more typical value. According 
to our estimates, hummingbird muscles should be twice as powerful as this 
world-class human value! Even for a small bird, hovering is hard work. 

Problem 3.35 Fueling hovering 

How much nectar must a hummingbird drink, as a fraction of its body mass, in 
order to hover for its working day (roughly 8 hours)? By mass, nectar is roughly 
50 percent sugar. 

Problem 3.36 Human hovering 

How much power would a person have to put out in order to hover by flapping 
his or her arms? 



.2 Lift in forward flight 

Now that we understand the fundamental mechanism 
of lift — discarding downward momentum by giving it 
to the air — we are ready to study forward flight: the 
flight of a migrating bird or of a plane. Forward flight 
is more complicated than hovering because forward 
flight has two velocities: the plane's forward velocity 
v and the downward component v z of the air's veloc- 
ity after passing around the wing. In forward flight, v z depends not only on 
the plane's weight and wingspan, but also on the plane's forward velocity. 

To stay aloft, the plane, like the hummingbird, must deflect air downward. 



wing 


T 

i 

i 




body 


L 




wing 


1 

1 

± 






The wing does this magic using complicated fluid mechanics, but we need 
not investigate it. All the gymnastics are hidden in the box. We need just 
the downward velocity v z required to keep the plane aloft, and the power 
required to give the air that much downward velocity. The power is, as 
with hovering, mgv z . However, the downward velocity v z is not the same 
as in hovering. 

It is determined by a slightly different momentum-flow diagram. It shows 
the air flow before and after it meets the wing. 
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V 

air 



Before the air reaches the wing (the left tube), the air has zero downward 
momentum. As in the analysis of hovering flight, the Earth supplies down- 
ward momentum to the plane, which passes it onto the air. This downward 
momentum is carried away by the air after the wing (the right tube). 

As with any flux, the rate of transfer of downward momentum is 

flux of downward momentum x area. (3.98) 




As in the analysis of the hummingbird, this rate must be mg, so that the 
plane stays aloft. The first factor, the flux of downward momentum, is 

density of downward momentum x flow speed. (3.99) 

Therefore, 

mg = density of downward momentum x flow speed x area. (3.100) 

As in the analysis of hovering, the density of downward momentum is pv z . 

In contrast to the analysis of hovering, where the stuff (downward momen- 
tum) is carried by the air moving downward, here the stuff is carried by air 
moving to the right. Thus, where the flow speed in hovering was the down- 
ward air speed v z , in forward flight the flow speed is the forward velocity v. 

As in the analysis of hovering, the relevant area 
is the squared wingspan L 2 , because the wings al- 
ter the airflow over a distance comparable to their 
longest dimension, which is their wingspan. You 
can see this effect in a NASA photograph of an 
airplane flying through a cloud of smoke. The gi- 
ant swirl, known as the wake vortex, has a diam- 
eter comparable to the plane's wingspan. Large 
planes can generate vortices that flip over small 
planes. Thus, when coming in for landing, planes 
must maintain enough separation to give these vortices time to dissipate. 
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With these estimates, the equation for v z becomes 

mg ~ Paiv V z x v X L 2 . 

transfer rate downward-momentum density flow speed area 

Now we can solve for the downward air speed: 

v. 



nig 



( 3 . 101 ) 



( 3 . 102 ) 



p ati vL 2 ' 

Now we can estimate the power required to generate lift in forward flight: 

(mg ) 2 



P = force x velocity 

A 



mg x 



mg 

PairVL 2 Pair^ 2 ' 



( 3 . 103 ) 



Here is a comparison of hovering and forward flight. 



hovering forward flight 



deflection area 

downward-momentum density 
flow speed 

downward-momentum flux 
downward-momentum flow mg 
downward velocity v z 
power to generate lift (mgv z ) 



L 2 


L 2 


Pair^z 


PairVz 


V z 


V 


Pai^ z 


PairV z V 


Pair^L 2 


PairV z vL 2 


V m g/PairL 2 


mg/PairVL 2 


mg^jmg/p air L 2 


( mg) 2 /p aiI vL 



In contrast to hovering, in forward flight the power contains the forward 
velocity in the denominator — a location that would produce nonsense for 
hovering, where the forward velocity is zero. 

As we did for hovering flight using the Calliope hummingbird, let's apply 
our knowledge of forward flight to an actual object. The object will be a 
Boeing 747-400 jumbo jet, and we will estimate the power that it requires in 
order to take off. A 747 has a wingspan L of approximately 60 meters, and 
a maximum takeoff mass m of approximately 4 x 10 5 kilograms (400 tons). 

We'll estimate the power in two steps: the weight mg and then the down- 
ward air speed v z . The weight is the easy step: It is just 4 x 10 6 newtons. 
The downward air speed v z is mg/p air vL 2 . The only unknown quantity is 
the takeoff speed v. You can estimate it by estimating the plane's acceler- 
ation n while taxiing on the runway and by estimating the duration of the 
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acceleration. When I last flew on a 747, 1 measured the acceleration by sus- 
pending my key chain from a string and estimating the angle 8 that it made 
with vertical (perpendicular to the ground). Then tan 8 = ci/g. For small 8, 
the relation simplifies to a/g ~ 8. I found 8 « 0.2, so the acceleration was 
about 0.2 g or 2 meters per second per second. This acceleration lasted for 
about 40 seconds, giving a takeoff speed of v « 80 meters per second (180 
miles per hour). 



The resulting downward speed v z is roughly 12 meters per second: 



mg 



4 x 10 6 N 



1.2 kgm 3 x 80ms 1 x 3.6xl0 3 m 2 

Pair V L 2 



12 ms 



( 3 . 104 ) 



Then the power required to generate lift is roughly 50 megawatts: 

P ~ mgv z « 4 x 10 6 N x 12 ms -1 « 5 x 10 7 W. ( 3 . 105 ) 



Let's see whether these estimates are reasonable. According to the plane's 
technical documentation, the 747-400's four engines together can provide 
roughly 1 meganewton of thrust. This thrust can accelerate the plane, with 
a mass of 4 x 10 5 kilograms, at 2.5 meters per second. This value is in good 
agreement with my estimate of 2 meters per second, made by suspending 
a key chain from a string and turning it into a plumb line. 

As another check: At takeoff, when v is roughly 80 meters per second, the 
meganewton of thrust corresponds to a power output Fv of 80 megawatts. 
This output is comparable to our estimate of 50 megawatts for the power 
to lift the plane off the ground. After liftoff, the engines use some of their 
power to lift the plane and some to accelerate the plane, because the plane 
still needs to reach its cruising speed of 250 meters per second. 

Symmetry and conservation make even fluid dynamics tractable. 



3.7 Summary and further problems 

In the midst of change, find what does not change — the invariant or con- 
served quantity. Finding these quantities simplifies problems: We focus 
on the few quantities that do not change rather than on the many ways in 
which quantities do change. An instance of this idea with wide application 
is a box model, where what goes in must come out. By choosing suitable 
boxes, we could estimate rainfalls and drag forces, and understand lift. 
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3 Symmetry and conservation 



Problem 3.37 Raindrop speed 

Use the drag force F drag ~ pA cs t’ 2 to estimate the terminal speed of a typical rain- 
drop with a diameter of 0.5 centimeters. How could you check the prediction? 

Problem 3.38 Average value of sin squared 

Use symmetry to find the average value of sin 2 f over the interval 

Problem 3.39 Moment of inertia of a spherical shell 

The moment of inertia of an object about an axis of rotation is 

Y m t df, (3.106) 

summed over all mass points i, where d, is the distance of the 
point from an axis of rotation. Use symmetry to find the moment 
of inertia of a spherical shell with mass m and radius r about an 
axis through its center. You shouldn't need to do any integrals! 

Problem 3.40 Flying bicyclist 

Estimate the wingspan a world-champion bicyclist would require in order to get 
enough lift for takeoff. 




Problem 3.41 Maximum-gain frequency for a second-order system 



In this problem, you use symmetry to maximize 
the gain of an LRC circuit or a spring-mass system 
with damping (using the analogy in Section 2.4.1). 
The gain G, which is the amplitude ratio V out / V in , 
depends on the signal's angular frequency co: 



G(w) 



yiu 

cv 0 



1 CO oP 

1+77 7 



(3.107) 



Q +’o 









Po, 



- ground 



where j = -V — 1, <u 0 is the natural frequency of the system, and Q, the quality factor, 
is a dimensionless measure of the damping. Don't worry about where the gain for- 
mula comes from: You can derive it using the impedance method (Problem 2.22), 
but the purpose of this problem is to maximize its magnitude |G(oi)|. Do so by 
finding a symmetry operation on u> that leaves |G(a>)| invariant. 



Problem 3.42 Runway length 

Estimate the runway length required by a 747 in order to take off. 



Problem 3.43 Hovering versus flying 

At what forward flight speed does the hummingbird of Section 3.6.1 require as 
much power to generate lift as it would to hover? How does this speed compare 
to its typical flight speed? 
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Problem 3.44 Resistive grid 

In an infinite grid of 1-ohm resistors, what is the resis- 
tance measured across one resistor? 

To measure resistance, an ohmmeter injects a current 
I at one terminal (for simplicity, imagine that 1 = 1 am- 
pere). It removes the same current from the other ter- 
minal, and measures the resulting voltage difference 
V between the terminals. The resistance is R = V /I. 

Hint: Use symmetry. But it's still a hard problem! 




Problem 3.45 Inertia tensor 

Here is an inertia tensor (the generalization of moment of inertia) of a particular 
object, calculated in an ill-chosen (but Cartesian) coordinate system: 

/ 4 0 0\ 

0 5 4 ( 3 . 108 ) 

\0 4 5/ 



a. Change the coordinate system to a set of principal axes, where the inertia tensor 
has the diagonal form 



I xx 0 0 \ 

0 Iyy 0 

0 0 I zz 



( 3 . 109 ) 



and give the principal moments of inertia I^, l yy , and I zz . Hint: Which proper- 
ties of a matrix are invariant when changing coordinate systems? 

b. Give an example of an object with a similar inertia tensor. Rhetorical question: 
In which coordinate system is it easier to think of such an object? 



This problem was inspired by a problem on the physics written qualifying exam 
during my days as a PhD student. The problem required diagonalizing an inertia 
tensor, and there was too little time to rederive or even apply the change-of-basis 
formulas. Time pressure sometimes pushes one toward better solutions! 



Problem 3.46 Temperature distribution on an infinite sheet 

On this infinite, uniform sheet, the x axis is held at zero tempera- 
ture, and the y axis is held at unit temperature ( T = 1). Find the 
temperature everywhere (except the origin!). Use Cartesian co- 
ordinates T(x,y ) or polar coordinates T(r, 6), whichever choice 
makes it easier to describe the temperature. 
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When there is change, look for what does not change. That principle, in- 
troduced when we studied symmetry and conservation (Chapter 3), is also 
the basis for our next tool, proportional reasoning. 



4.1 Population scaling 

An everyday example of proportional reasoning often happens when cook- 
ing for a dinner party. When I prepare fish curry, which I normally cook 
for our family of four, I buy 250 grams of fish. But today another family of 
four will join us. 

How much fish do I need? 

I need 500 grams. As a general relation, 

. , , new number of diners 

new amount = old amount x — . (4.1) 

usual number of diners 

Another way to state this relation is that the amount of fish is proportional 
to the number of diners. In symbols. 
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4 Proportional reasoning 



^fish x ^diners' 

where the ex symbol is read "is proportional to." 

But where in this analysis is the quantity that does not change? 



(4.2) 



Another way to write the proportionality relation is 

new amount of fish _ old amount of fish 
new number of diners old number of diners 

Thus, even when the number of diners changes, the quotient 

amount of fish 
number of diners 



does not change. 



For an analogous application of proportional reasoning, here's one way to 
estimate the number of gas stations in the United States. Following the 
principle of using human-sized numbers, which we discussed in Section 1 .4, 
I did not try to estimate this large number directly. Instead, I started with 
my small hometown of Summit, New Jersey. It had maybe 20 000 people 
and maybe five gas stations; the "maybe" indicates that these childhood 
memories may easily be a factor of 2 too small or too large. If the number 
of gas stations is proportional to the population (N stations cc N people ), then 



N us 

stations 



= N Summit: x 
stations 



N us , 

people 



3 x10 s 
2 x 10 4 ' 

at Summit 
people 



(4.5) 



The population ratio is roughly 15 000. Therefore, if Summit has five gas 
stations, the United States should have 75 000. We can check this estimate. 
The US Census Bureau has an article (from 2008) entitled "A Gas Station for 
Every 2,500 People"; its title already indicates that an estimate of roughly 
10 5 gas stations is reasonably accurate: Summit, in my reckoning, had 4000 
people per gas station. Indeed, the article gives the total as 116 855 gas 
stations — as close to the estimate as we can expect given the uncertainties 
in childhood memories! 



Problem 4.1 Homicide rates 

The US homicide rate (in 2011) was roughly 14000 per year. The UK rate in the 
same year was roughly 640. Which is the more dangerous country (per person), 
and by what factor? 
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4.2 Finding scaling exponents 

The dinner example (Section 4.1) used linear proportionality: When the 
number of dinner guests doubled, so did the amount of food. The relation 
between the quantities had the form y oc x or, more explicitly, y oc x 1 . The 
exponent, which here is 1, is called the scaling exponent. For that reason, 
proportionalities are often called scaling relations. Scaling exponents are a 
powerful abstraction: Once you know the scaling exponent, you usually do 
not care about the mechanism underlying it. 

4.2.1 Warmup 

After linear proportionality, the next simplest and most common 
type of proportionality is quadratic — a scaling exponent of 2 — and 
its close cousin, a scaling exponent of 1 /2. As an example, here is 
a big circle with diameter c/ big = y! 5 cm. 

What is the diameter of the circle with one-half the area of this circle? 

Let's first do the very common brute-force solution, which does not 
use proportional reasoning, so that you see what not to do. It begins 
with the area of the big circle: 

^bi g = f4g = ^Trcrn 2 . (4.6) 

The area of the small circle A small is A big /2, so A small = 57 T/ 8 cm 2 . ^small 
Therefore, the diameter of the small circle is given by 

= (4.7) 

Although this result is correct, by including n /4 and then dividing it out, 
we run around Robin Hood's barn (all of Sherwood forest) to reach a simple 
result. There must be a more elegant and insightful approach. 

This improved approach also starts with the relation between a circle's area 
and its diameter: A — Tid 2 /4. However, it discards the complexity early — in 
the next step — rather than carrying it through the analysis and having it 
vanish only at the end. An everyday analog of this approach is packing for 
a trip. Rather than dragging around books that you will not read or clothes 
that you will not wear, prune early and travel light: Pack only what you will 
use and set aside the rest. 





ho <r 
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4 Proportional reasoning 



To lighten your problem-solving luggage, observe that all circles, indepen- 
dent of their diameter, have the same prefactor 7T/4 connecting d 2 and A. 
Therefore, when we make a proportionality or scaling relation between A 
and d, we discard the prefactor. The result is the following quadratic pro- 
portionality (one where the scaling exponent is 2): 

A oc d 2 . (4.8) 



For finding the new diameter, we need the inverse scaling relation: 

d oc A l/2 . (4.9) 



In this form, the scaling exponent is 1 /2. This proportionality is shorthand 
for the ratio relation 

. 1/2 

(4.10) 



d, 



small 



d h 



A, 



small I 



Aig / 



. 5n 2 
^big = T cm 



A = -d 2 
4 



^big \ ^big 

The area ratio is 1 /2, so the diameter ratio is 1 / V 2 . Because the large diam- 
eter is 'J 5 cm, the small diameter is f5/2 cm. 

The proportional-reasoning solution is 
shorter than the brute-force approach, 
so it offers fewer places to go wrong. It 
is also more general: It shows that the 
result does not require that the shape 
be a circle. As long as the area of the 
shape is proportional to the square of 
its size (as a length) — a relation that 

holds for all planar shapes — the length ratio is 1 lf2 whenever the area ratio 
is 1/2. All that matters is the scaling exponent. 



Problem 4.2 Length of the horizontal bisecting path 

In Problem 3.23, about the shortest path that bisects an equilateral 
triangle, one candidate path is a horizontal line. How long is that 
line relative to a side of the triangle? 



dbig = V5 cm - 



(extra baggage) 

d — . 

d oc A 1 / 2 
prop, reasoning 



I 

Tmall = ^ 




Areas are connected to flux, because flux is rate per area. Thus, the scaling 
exponent for area — namely, 2 — appears in flux relationships. For example: 

What is the solar flux at Pluto's orbit? 



The solar flux F at a distance r from the Sun is the solar luminosity L Sun — the 
radiant power output of the sun — spread over a sphere with radius r: 
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I'Sun 

4nr 2 



(4.11) 



Even as r changes, the solar luminosity remains the same (conservation!), 
as does the factor of 4 tt. Therefore, in the spirit of packing light for a trip, 
simplify the equality F = L Sun /47rr 2 to the proportionality 

F oc r~ 2 (4.12) 



by discarding the factors L Sun and 4 tt. The scaling exponent here is —2: The 
minus sign indicates the inverse proportionality between flux and area, and 
the 2 is the scaling exponent connecting r to area. 

The scaling relation is shorthand for 

^"Pluto's orbit ( ^Pluto's orbit 

^"Earth's orbit V ^Earth's orbit 



(4.13) 



or 



Pluto's orbit 



Earth's orbit 



[ ^Pluto's orbit 
\ ^Earth's orbit 



-2 



(4.14) 



The ratio of orbital radii is roughly 40. Therefore, the solar flux at Pluto's 
orbit is roughly 40“ 2 or 1/1600 of the flux at the Earth's orbit. The resulting 
flux is roughly 0.8 watts per square meter: 



1300 W 



1 



0.8 W 



Pluto's orbit 



nv 6 - 



1600 



rm 



(4.15) 



sunlight 



energy 
at planet 
surface 



Receiving such a small amount of sunlight, Pluto must be very cold. We 
can estimate its surface temperature with a further proportionality. 

Surface temperature depends mostly on so-called 
blackbody radiation. The surface temperature is 
the temperature at which the radiated flux equals 
the incoming flux; we are making another box 

model. The radiated flux is given by the blackbody formula (which we 
will derive in Section 5.5.2) 

F = nT 4 , (4.16) 

where T is the temperature, and a is the Stefan-Boltzmann constant: 

W 



blackbody 



radiation 



cr 



5.7 x lo- 



rn 



2 K 4 



(4.17) 



What is the resulting surface temperature on Pluto? 
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4 Proportional reasoning 



As with any proportional-reasoning calculation, there is a long-winded, 
brute-force alternative (try Problem 4.5). The elegant approach directly 
uses the proportionalities 

T oc F 1 / 4 and F ex r -2 , (4.18) 

where r is the orbital radius. Together, they produce a new proportionality 

T « (r- 2 ) 1/4 = r~ 1/2 . (4.19) 

A compact graphical notation, similar to the divide-and-conquer trees, en- 
capsulates this derivation: 




T 



As indicated by the r — > F arrow, changing r changes F. The boxed number 
along the arrow gives the scaling exponent. Therefore, the r — > F arrow 
represents F cx r~ 2 . The F — > T arrow indicates that changing F changes T 
and, in particular, that T oc F 1/4 . 



To find the scaling exponent connecting r to T, multiply the scaling expo- 
nents along the path: 




(4.20) 



Problem 4.3 Explaining the graphical notation 

In our graphical representation of scaling relations, why is the final scaling expo- 
nent the product, rather than the sum, of the scaling exponents along the way? 

This scaling exponent represents the following comparison: 

^Earth ( ^Earth's orbit f ) Pluto's orbit 

^Pluto \ ^Pluto's orbit ) V ^Earth's orbit 



1/2 



(4.21) 



(The rightmost form, with the positive exponent, is more direct than the in- 
termediate form, because it does not first produce a fraction smaller than 1 
and then take its reciprocal with a negative exponent.) The ratio of orbital 
radii is 40, so the ratio of surface temperatures should be \f40 or roughly 6. 
Pluto's surface temperature should be roughly 50 K: 



1 Pluto 



^Earth 

6 



293 K 
6 



50 K. 



(4.22) 



Pluto's actual mean surface temperature is 44 K, very close to our prediction 
based on proportional reasoning. 
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Problem 4.4 Explaining the discrepancy 

Why is our prediction for Pluto's mean temperature slightly too high? 

Problem 4.5 Brute-force calculation of surface temperature 

To practice recognizing common but inferior problem-solving methods, use the 
brute- force method to estimate the surface temperature on Pluto: (a) From the so- 
lar flux at Pluto's orbit, calculate the solar flux averaged over the surface; (b) use 
that flux to estimate a blackbody temperature. 



4.2.2 Orbital periods 

In the preceding examples, the scaling relations formed chains (trees with- 
out branching): 



(4.23) 




surface temperature : r — 2 



More elaborate relationships also occur — as we will find in rederiving a 
famous law of planetary motion. 

How does a planet's orbital period depend on its orbital radius r? 



We'll study the special case of circular orbits (many planetary orbits are 
close to circular). Our exploratory thinking is often aided by making pro- 
portionality questions concrete. Therefore, rather than finding the scaling 
exponent using the abstract notion of "depend on," answer the doubling 
question: "When I double this quantity, what happens to that quantity?" 

What is so special about doubling? 



Doubling — multiplying by a factor of 2 — is the simplest useful change. A 
factor of 1 is simpler; however, being no change at all, it is too simple. 

What happens to the period if we double the orbital radius? 



The most direct effect of doubling the orbital radius is that gravity gets 
weaker. Because the gravitational force is an inverse-square force — that is, 
F oc — the gravitational force falls by a factor of 4. A compact and in- 

tuitive notation for these changes is to mark the change directly under the 
quantity: A notation of xn indicates multiplication by a factor of n. 
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4 Proportional reasoning 



Far 2 . 

x i x2 



(4.24) 



Because force is proportional to acceleration, the planet's acceleration n falls 
by the same factor of 4. 

In circular motion, acceleration and velocity are related by a = v 2 /r. (We 
will derive this relation in Section 5.1.1 and Section 6.3.4, with two different 
reasoning tools.) Therefore, the orbital velocity v is Jar, and doubling the 
radius increases the orbital speed by a factor of ^ 1/2: 

1 12 

v = (a x r ) . (4.25) 

xjl xi x2 

Although this calculation is correct, when it is stated as a factor of ^ 1/2 
it confounds our expectations and produces numerical whiplash. As we 
finish reading "increases by a factor of," we expect a number greater than 1. 
But we get a number smaller than 1. An increase by a factor smaller than 1 
is more simply described as a decrease. Therefore, it is more direct to say 
that the orbital speed falls by a factor of J2. 

The orbital period is T ~ r jv (the ~ contains the dimensionless prefactor 
2n), so it increases by a factor of 2’ 22 : 

T ~ r x v~ l . (4.26) 

x2 3 ' 2 x2 xj2 



In summary, doubling the orbital radius multiplies the orbital period by 
2 3/2 . In general, the connection between r and T is 

T ex r 3/2 . (4.27) 



This result is Kepler's third law for circular orbits. Our scaling analysis has 
the following graphical representation: 




In this structure, the new feature is that two paths reach the orbital veloc- 
ity v. There we add the incoming exponents. 
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1. r - + 1 — *- v represents the J7 in v = Jar. It carries +1 /2 powers of r. 

2. a - +\ —*■ v represents the (a in v = Jar. To determine how many 

powers of r flow through this arrow, follow the chain containing it: 

r 2 — *- F - +lj — >- - +1 — > v 

One power of r starts at the left side. It becomes —2 powers after pass- 
ing through the first scaling exponent and arriving at F. It remains —2 
powers on arrival at a . Finally, it becomes —1 power on arrival at v. This 
path therefore carries —1 power of r. 

Its contribution is the result of multiplying the three scaling exponents 
along the chain: 

1 

-2 X +1 X -- = -1. (4.28) 

r->F F->a a->v 

The —1 power carried by the three-link chain, representing r , combines 
with the +1 / 2 from the direct r — » v arrow, which represents r +1/2 . Adding 
the exponents on r, the result is that v contains —1/2 powers of r: 

v oc r -1 x r +1 / 2 = r -1 / 2 . (4.29) 

via F direct 

Let's practice the same reasoning by finding the scaling relation connecting 
T and r — which is Kepler's third law. The direct r —> T arrow, with a scaling 
exponent of +1, carries +1 powers of r. The v — > T arrow carries —1 powers 
of v. Because v contains —1/2 powers of r, the v -* T arrow carries +1/2 
powers of r: 

x -1 = +|- (4.30) 

r->v v->T 

Together, the two arrows contribute +3/2 powers of r and give us Kepler's 
third law: 

To c r +1 x r +1/2 = r +3/2 . (4.31) 

r->T viai; 

To summarize the exponent rules, for which we now have several illustra- 
tions: (1) Multiply exponents along a path, and (2) add exponents when 
paths meet. 

Now let's apply Kepler's third law to a nearby planet. 
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4 Proportional reasoning 



How long is the Martian year? 



The proportionality T <x r 3/2 is shorthand for the comparison 

^Vlars t ^Mars 

Uef V hef 




( 4 . 32 ) 



where r Mars is the orbital radius of Mars, r ref is the orbital radius of a refer- 
ence planet, and T ref is its orbital period. Because we are most familiar with 
the Earth, let's choose it as the reference planet. The reference period is 1 
(Earth) year; and the reference radius is 1 astronomical unit (AU), which is 
1.5 x 10 11 meters. The benefit of this choice is that we will obtain the period 
of Mars's orbit in the familiar unit of Earth years. 



The distance of Mars to the Sun varies between 2.07 x 10 11 meters (1.38 as- 
tronomical units) and 2.49 x 10 11 meters (1.67 astronomical units). Thus, 
the orbit of Mars is not very circular and has no single orbital radius r Mars . 
(Its significant deviation from circularity allowed Kepler to conclude that 
planets move in ellipses.) As a proxy for r Mars , let's use the average of the 
minimum and maximum radii. It is 1.52 astronomical units, making the 
ratio of orbital periods approximately 1.88: 



L Mars 
^Earth 



1.52 AU\ 
1 AU ) 



3/2 



1 . 88 . 



( 4 . 33 ) 



Therefore, the Martian year is 1.88 Earth years long. 

Problem 4.6 Brute-force calculation of the orbital period 

To emphasize the contrast between proportional reasoning and the brute-force ap- 
proach, find the period of Mars's orbit using the brute-force approach by starting 
with Newton's law of gravitation and then finding the orbital velocity and circum- 
ference. 

A surprising conclusion about orbits comes from our doubling question 
introduced on page 109. 



► 



What happens to the period of a planet if you double its mass? 



Using a type of thought experiment due to Galileo, imagine two identical 
planets, orbiting one just behind the other along the same orbital path. They 
have the same period. Now tie them together. The rope does not change 
the period, so the double-mass planet has same period as each individual 
planet: The scaling exponent is zero (T oc m°)\ 
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Problem 4.7 Pendulum period versus mass 

How does the period of an ideal pendulum depend on the mass of the bob? 



4.2.3 Projectile range 



In the previous examples, only one variable was 
independent; changing it changed all the others. 

However, many problems contain multiple inde- 
pendent variables. An example is the range R of 
a rock launched at an angle 8 with speed v. The 
traditional derivation uses calculus. You solve for the position of the rock 
as a function of time, solve for the time when its height is zero (the ground 
level), and then insert that time into the horizontal position to find the range. 
This analysis is not wrong, but its result still seems like magic. I leave un- 
satisfied, thinking, "The result must be true. But I still do not know why." 

That "why" insight comes from proportional reasoning, which discards the 
nonessential complexity. Let's begin with our doubling question. 




► 



How does doubling each of the independent variables affect the range? 



The independent variables include the launch velocity v, the gravitational 
acceleration g (because gravity returns the rock to Earth), and the launch 
angle. However, angles do not fit so easily into proportional reasoning, so 
we won't explore the role of 8 here (we will handle it in Section 8.2.2. 1 using 
the tool of easy cases, or you can try Problem 4.9). 

Because the only forces are vertical, the rock's horizontal velocity remains 
constant throughout its flight (an invariant!). Thus, the range is given by 

R = time aloft x initial horizontal velocity. (4.34) 

The time aloft is determined by the initial vertical velocity, because gravity 
steadily reduces it (at the rate g): 

. , initial vertical velocity 

time aloft (4.35) 

S 

Problem 4.8 Missing dimensionless prefactor 

What is the missing dimensionless prefactor in the preceding expression for the 
time that the rock stays aloft? 
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4 Proportional reasoning 



Now double the launch velocity v. That change doubles the initial horizon- 
tal and vertical components of the velocity. Doubling the vertical compo- 
nent doubles the time aloft. Because the range is proportional to the hori- 
zontal velocity and to the time aloft, when the launch velocity doubles, the 
range quadruples. The scaling exponent connecting v to R is 2: R oc y 2 . 

What is the effect of doubling g? 

Doubling g doesn't change the horizontal velocity or the initial vertical ve- 
locity, but it halves the time aloft and therefore the range as well. The scal- 
ing exponent connecting g to R is —1 : R oc . 

The combined scaling relation, which gives the dependence of R on both g 
and v, is 

v 2 

Roc (4.36) 

8 

Using v x and v y for the horizontal and vertical components of the launch 
velocity, the graphical representation of this reasoning is 




This graph shows a new feature: two independent variables, v and g. We'll 
need to track the powers of v and g separately. 

The range R has two incoming paths. The path via the horizontal velocity 
v x contributes +lx+l = +l powers of v but no powers of g. The path 
via t also contributes +1 powers of v, and contributes —1 powers of g. The 
diagram compactly represents how R became proportional to v 2 /g. 

The full range formula, including the launch angle 6 is 

v 2 

R sin 6 cos 9. (4.37) 

S 

The dependence on v and g is just as we predicted. 

The moral of this example is that you can derive and understand relations 
by ignoring constants of proportionality and instead concentrating on the 
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scaling exponents. Furthermore, you can use this ability in order to spot 
mistakes: Just check each independent variable's scaling exponent. For ex- 
ample, if someone proposes that projectile range R is proportional to v 3 /g, 
think, "The 1 /g makes sense from the time aloft. But what about the v 3 ? 
One power of v comes from the horizontal velocity and one power from 
the time aloft, which explains two powers of v. But where does the third 
power comes from? The range should instead contain v 2 /g." 

Problem 4.9 Angular factors in projectile range 

Explain the sin 6 and cos 6 factors by using the relation 

range = time aloft x horizontal velocity. (4.38) 



4.2.4 Planetary surface gravity 

Scaling, or proportional reasoning, connects independent to dependent vari- 
ables. Often we have freedom in choosing the independent variables. Here 
is an example to show you how to use that freedom. 

Assuming that planets are uniform spheres, how does g, the gravitational accelera- 
tion at the surface, depend on the planet's radius R? 

We seek the scaling exponent n iny oc R n . At the planet's surface, the grav- 
itational force F on an object of mass m is GMm / R 2 , where G is Newton's 
constant, and M is the planet's mass. The gravitational acceleration g is 
F /m or GM/R 2 . Because G is the same for all objects, we pack light and 
eliminate G to make the proportionality 
M 

g « £2- (4-39) 

In this form, with mass M and radius R as the independent variables, the 
scaling exponent n is —2. 

However, an alternative relation comes from noticing that the planet's mass 
M depends on the planet's radius R and density p as M ~ pR 3 . Then 

pR 3 

got — = pR. (4.40) 

In this form, with density and radius as the independent variables, the scal- 
ing exponent on R is now only 1. 

Which scaling relation, with mass or density, is preferable? 





116 



4 Proportional reasoning 



Planets vary widely in their mass: from 3.3 x 10 23 kilograms (Mercury) to 
1.9 x 10 27 kilograms (Jupiter), a range 4 decades wide (a factor of 10 4 ). They 
vary greatly in their radius: from 7 x 10 4 kilometers (Jupiter) down to 2.4 x 
10 3 kilometers (Mercury), a range of a factor of 30. The quotient M/R 2 has 
huge variations in the numerator and denominator that mostly oppose each 
other. When there is change, look for what does not change — or, at least, 
what does not change as much. In contrast to masses, planetary densities 
vary from only 0.7 grams per cubic centimeter (Saturn) to 5.5 grams per 
cubic centimeter (Earth) — a range of only a factor of 8. The variations in 
planetary surface gravity are easier to understand using the planet's radius 
and density rather than its radius and mass. 

This result is general. Mass is an extensive quantity: When two objects 
combine, their masses add. Density, in contrast, is an intensive quantity. 
Adding more of a particular substance does not change its density. Using 
intensive quantities for the independent variables usually leads to more 
insightful results than using extensive quantities does. 

Problem 4.10 Distance to the Moon 

The orbital period near the Earth's surface (say, for a low-flying satellite) is roughly 
1.5 hours. Use that information to estimate the distance to the Moon. 

Problem 4.11 Moon's angular diameter and radius 

On a night with a full moon, estimate the Moon's angular diameter — that is, the 
visual angle subtended by the Moon. Use that angle and the result of Problem 4.10 
to estimate the Moon's radius. 

Problem 4.12 Surface gravity on the Moon 

Assuming that all planets (and moons) have the same density, use the radius of the 
Moon (Problem 4.11) to estimate its surface gravity. Then compare your estimate 
with the actual value and suggest an explanation for the discrepancy. 

Problem 4.13 Gravitational strength inside a planet 

Imagine a uniform, spherical planet with radius R. How does the gravitational 
acceleration g depend on r , the distance from the center of the planet? Give the 
scaling exponent when r < R and when r >R, and sketch g(r). 

Problem 4.14 Making toast land butter-side up 

As a piece of toast slides off a dining table (starting with almost no horizontal 
velocity), it picks up angular velocity. Once it leaves the table, its angular velocity 
remains constant. In everyday experience, a toast usually backflips (rotates 180°) 
by the time it hits the ground, and lands butter-side down. How high would tables 
have to be for a piece of toast to land butter-side up? 
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4.3 Scaling exponents in fluid mechanics 

The preceding introductory examples may mislead you into thinking that 
proportional reasoning is useful only when we could also find the exact 
solution. As a counterexample, we return to that source of mathematical 
beauty but also misery, fluid mechanics, where exact solutions exist for 
hardly any situations of practical interest. To make progress, we need to 
discard complexity by focusing on the scaling exponents. 



4.3.1 Falling cones 

In Section 3.5.1, we used conservation reasoning to show that the drag force 
is given by 

^drag ~ PA C S V • (4.41) 

In Section 3.5.2, we tested this result by correctly predicting the terminal 
speed of a falling paper cone. However, that experiment concerned only 
one cone of a particular size. A natural generalization of that experiment is 
to predict a cone's terminal speed as a function of its size. 

How does the terminal speed of a paper cone depend on its size? 



Size is an ambiguous notion. It might refer to an area, a volume, or a length. 
Here, let's consider size to be the cone's cross-sectional radius r. The quan- 
titative question is to find the scaling exponent n in v ex r”, where v is the 
cone's terminal speed. 

The doubling question is, "What happens to the terminal speed when we 
double r?" At the terminal speed, drag and weight mg balance: 

PaiAs^ 2 ~ mg. (4.42) 

drag weight 



Therefore, the terminal speed is just what we found in Section 3.5.2: 



v 



/ 



mg 

Tair^cs 



(4.43) 



All cones feel the same g and p air , so we pack light and eliminate these 
variables to make the proportionality 




V OC 



(4.44) 
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Doubling r quadruples the amount of paper used to make the cone and 
therefore its mass m. It also quadruples its cross-sectional area A cs . Accord- 
ing to the proportionality, the two effects cancel: When r doubles, v should 
remain constant. All cones of the same shape (and made from the same 
paper) should fall at the same speed! 

This result always surprises me. So I tried the experiment. I printed the 
cone template in Section 3.5.2 at 400-percent magnification (a factor of 4 in- 
crease in length), cut it out, taped the two straight edges together, and raced 
the small and big cones by dropping them from a height of about 2 meters. 
After a roughly 2-second fall, they landed almost simultaneously — within 
0.1 seconds of each other. Thus, their terminal speeds are the same, give or 
take 5 percent. 

Proportional reasoning triumphs again! Surprisingly, the proportional-rea- 
soning result is much more accurate than the drag-force estimate p ail A cs v 2 
on which it is based. 

How can predictions based on proportional reasoning be more accurate than the 
original relations? 



To see how this happy situation arose, let's redo the calculation but include 
the dimensionless prefactor in the drag force. With the dimensionless pref- 
actor (shaded in gray), the drag force is 



t~drag y hi Fairies ^ ' 



(4.45) 



where c d is the drag coefficient (introduced in Section 3.2.1). The prefactor 
carries over to the terminal speed: 



v = 



m 




(4.46) 



Ignoring the prefactor decreases v by a factor of ^2/c d . (For nonstreamlined 
objects, c d ~ 1 , so the decrease is roughly by a factor of 42.) 



In contrast, in the ratio of terminal speeds Wig/Wmaiu the prefactor drops 
out. Here is the ratio with the prefactors shaded in gray: 
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As long as the drag coefficients c d are the same for the big and small cones, 
they divide out from the ratio. Then the scaling result — that the terminal 
speed is independent of size — is exact, independent of the drag coefficient. 

The moral, once again, is to build on what we know, rather than computing 
quantities that we do not need and that only clutter the analysis and thereby 
our thinking. Scaling relations bootstrap our knowledge. Here, if you know 
the terminal speed of the small cone, use that speed — and the scaling result 
that speed is independent of size — to find the terminal speed of the large 
cone. In the next section, we will apply this approach to estimate the fuel 
consumption of a plane. 

Problem 4.15 Taping the cones 

The big cone, with twice the radius of the small cone, has four times the weight 
of paper. But what about the tape? If you tape along the entire radius of the cone 
template, how does the length of the tape compare between the large and small 
cones? How should you apply the tape in order to maintain the 4 : 1 weight ratio? 

Problem 4.16 Bigger and bigger cones 

Further test the scaling relation v oc r° (terminal speed is independent of size) by 
building a huge cone from a template with four times the radius of the template 
for the small cone. Then race the small, big, and huge cones. 

Problem 4.17 Four cones versus one cone 
Use the cone template in Section 3.5.2 (at 200-percent enlarge- 
ment) to make five small cones. Then stack four of the cones 
to make one heavy small cone. How much faster will the 
four-cone stack fall in comparison to the single small cone? 

That is, predict the ratio of terminal speeds 

Pfour cones . (4 48) 

^one cone 

Then check your prediction by trying the experiment. 




4.3.2 Fuel consumption of a Boeing 747 jumbo jet 

In Section 3.5.4, we estimated the fuel consumption of automobiles. For the 
next example, we'll estimate the fuel consumption of a Boeing 747 jumbo 
jet. Rather than repeating the structure of the automobile estimate in Sec- 
tion 3.5.4 but with parameters for a plane — which would be the brute-force 
approach — we will reuse the automobile estimate and supplement it with 
proportional reasoning. 
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4 Proportional reasoning 



A plane uses its fuel to generate energy to fight drag and to generate lift. 
However, for this estimate, forget about lift. At a plane's cruising speed, 
lift and drag are comparable (as we will show in Section 4.6). Neglecting 
lift ignores only a factor of 2, because lift plus drag is twice the lift alone, 
and allows us to make a decent estimate without having to estimate the lift. 
Divide and conquer: Don't bite off all the complexity at once! 

The energy consumed fighting drag is proportional to the drag force: 

E <* PaiAs^- (4-49) 



This scaling relation is shorthand for the following comparison between a 
plane and a car: 




cruising altitude plane 

r air x cs 

0 sea level ^4 car 

r air cs 



X 




(4.50) 



Thus, the energy-consumption ratio breaks into three ratio estimates. 



What are reasonable estimates for the three ratios? 



1. Air density. A plane's cruising altitude is typically 35 000 feet or 10 kilo- 
meters, which is slightly above Mount Everest. At that height, mountain 
climbers require oxygen tanks, so the air, and oxygen, density must be 
significantly lower than it is at sea level. (Once you learn the reasoning 
tool of lumping, you can predict the density — see Problem 6.36.) The 
density ratio turns out to be roughly 3: 

cruising altitude 
P air ^ 1 

q sea level 3* 

< air 

The thinner air wins the plane a factor of 3 in fuel efficiency. 

2. Cross-sectional area. To estimate the cross-sectional area of the 
plane, we need to estimate the width and height of the plane's 
body; its wings are very streamlined, so they contribute negli- 
gible drag. When estimating lengths, let's make our measuring 
rod (our unit) a person length. Using this measuring rod has 
two benefits. First, the measuring rod is easy for our gut to pic- 
ture, because we feel our own size innately. Second, the lengths that we 
will measure will be only a small multiple of a person length. The nu- 
merical part of the quantity (for example, the 1.5 in "1.5 person lengths") 
will be comparable to 1, and therefore also easy to picture. As a rule of 



(4.51) 




1 \ ^ _i 

cross section 
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thumb, ratios between 1/3 and 3 are easy to picture and feel in our gut 
because our mental number hardware is exact for the quantities 1, 2, 
and 3. We choose our measuring rods accordingly. 

In applying the person-length measuring rod to the width of the plane's 
body, I remember the comfortable days of regulated air travel. As a 
child, I would find three adjacent empty seats in the back of the plane 
and sleep for the whole trip (which explains why my parents insist that 
traveling with small children was easy). A jumbo jet is three or four such 
seat groups across — call it three person lengths — and its cross section is 
roughly circular. A circle is roughly a square, so the cross-sectional area 
is roughly 10 square person lengths: 

(3 person lengths) 2 « 10 (person length) 2 . (4.52) 

Although we might worry that this estimate used too many approxima- 
tions, we should do it anyway: It gives us a rough-and-ready value and 
allows us to make progress. Our goal is edible jam today, not delicious 
jam tomorrow! 

Now let's estimate the cross-sectional area of a car. In standard 
units, it's about 3 square meters, as we estimated in Section 3.5.4. 

But let's apply our person-sized measuring rod. From noctur- 
nal activities in cars, you may have experienced that cars, un- 
comfortably, are about one person across. A car's cross section 
is roughly square, so the cross-sectional area is roughly 1 square 
person length. 

Problem 4.18 Comparing the cross-sectional areas 

How well does 1 square person length match 3 square meters? 

The ratio of cross-sectional areas is roughly 10; therefore, the plane's 
larger cross-sectional area costs it a factor of 10 in fuel efficiency. 

3. Speed. A reason to fly rather than drive is that planes travel faster than 
cars. A plane travels almost at the speed of sound: 1000 kilometers per 
hour or 600 miles per hour. A car travels at around 100 kilometers per 
hour or 60 miles per hour. The speed ratio is roughly a factor of 10, so 

f^V- 100. (4.53) 

V U car ) 

The plane's greater speed costs it a factor of 100 in fuel efficiency. 
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Now we can combine the three ratio estimates to estimate the ratio of energy 
consumptions: 




~ | x 10 x 100 « 300. 

density area (speed ) 2 



(4.54) 



A plane should be 300 times less fuel efficient than a car — terrible news for 
anyone who travels by plane! 



Should you therefore never fly again? 



Our conscience is saved because a plane carries roughly 300 passengers, 
whereas a typical (Western) car is driven by one person to and from work. 
Therefore, per person, a plane and a car should have comparable fuel effi- 
ciencies: 30 passenger miles per gallon, as we estimated in Section 3.5.4, or 
8 liters per 100 passenger kilometers. 

According to Boeing's technical data on the 747-400 model, the plane has a 
range of 13 450 kilometers, its fuel tank contains 216 840 liters, and it carries 
416 passengers. These data correspond to a fuel consumption of 4 liters per 
100 passenger kilometers: 

216 840 1 1 4 { 

x . (4.55) 

13 450 km 416 passengers 100 passenger km 

Our proportional-reasoning estimate of 8 liters per 100 passenger kilome- 
ters is very reasonable, considering the simplicity of the method compared 
to a full fluid-dynamics analysis. 

Problem 4.19 Density of air by proportional reasoning 

Make rough estimates of your own top swimming and cycling speeds, or use the 
data that a record 5-kilometer swim time is 56:16.6 (almost 1 hour). Thereby ex- 
plain why the density of water must be roughly 1000 times the density of air. Why 
is this estimate more accurate when based on cycling rather than running speeds? 

Problem 4.20 Raindrop speed versus size 

How does a raindrop's terminal speed depend on its radius? 

Problem 4.21 Estimating an air ticket price from fuel cost 

Estimate the fuel cost for a long-distance plane journey — for example, London to 
Boston, London to Cape Town, or Los Angeles to Sydney. How does the fuel cost 
compare to the ticket price? (Airlines do not pay many of the fuel taxes paid by 
ordinary motorists, so their fuel costs are lower than motorists' fuel costs.) 
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Problem 4.22 Relative fuel efficiency of a cycling 

Estimate the fuel efficiency, relative to a car, of a bicyclist powered by peanut but- 
ter traveling at a decent speed (say, 10 meters per second or 20 miles per hour). 
Compare your estimate with your estimate in Problem 3.34. 



4.4 Scaling exponents in mathematics 

Our scaling relations so far have connected physical quantities. But propor- 
tional reasoning can also bring us insight in mathematics. A classic exam- 
ple is the birthday paradox. 

How many people must be in a room before the probability that two people share a 
birthday (for example, two people are born on July 18) is at least 50 percent? 



Almost everyone, including me, reasons that, with 365 days in the year, we 
need roughly 50 percent of 365, or 183 people. Let's test this conjecture. The 
actual probability of having a shared birthday is 



1 - 





n — 1 \ 
365 ) ' 



(4.56) 



(A derivation is in Everyday Probability and Statistics [50, p. 49] or in the ad- 
vanced classic Probability Theory and its Application [13, vol. 1, p. 33]. But 
you can explain its structure already: Try Problem 4.23.) For n — 183, the 
probability is 1 — 4.8 x 10 -25 or almost exactly 1. 

Problem 4.23 Explaining the probability of a shared birthday 

Explain the pieces of the formula for the probability of a shared birthday: What is 
the reason for the 1 — in front of the product? Why does each factor in the product 
have a 1 — ? And why does the last factor have (n — l)/365 rather than m/365? 

To make this surprisingly high probability seem plausible, I gave random 
birthdays to 183 simulated people. Two people shared a birthday on 14 days 
(and we need only one such day); and three people shared a birthday on 
three days. According to this simulation and to the exact calculation, the 
plausible first guess of 183 people is far too high. This surprising result is 
the birthday paradox. 

Even though we could use the exact probability to find the threshold num- 
ber of people that makes the probability greater than 50 percent, we would 
still wonder why: Why is the plausible argument that n « 0.5 x 365 so badly 
wrong? That insight comes from a scaling analysis. 
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4 Proportional reasoning 



At roughly what n does the probability of a shared birthday rise above 50 percent? 

An image often helps me translate a problem into mathematics. Imagine 
everyone in the room greeting all the others by shaking hands, one hand- 
shake at a time, and checking for a shared birthday with each handshake. 

How many handshakes happen? 



Each person shakes hands with n — 1 others, making n(n — 1) arrows from 
one person to another. But a handshake is shared between two people. To 
avoid counting each handshake twice, we need to divide n (n — 1) by 2. Thus, 
there are n(n — l)/2 or roughly n 2 /2 handshakes. 



With 365 possible birthdays, the probability, per handshake, of a shared 
birthday is 1/365. With n 2 / 2 handshakes, the probability that no hand- 
shake joins two people with a shared birthday is approximately 





(4.57) 



To approximate this probability, take its natural logarithm: 



In 




n 2 / 2 





(4.58) 



Then approximate the logarithm using ln(l + x) 
1 \ 1 



x ( ror x « i j: 



In 1 - 



365 ^ 



365' 



(4.59) 



(You can test this useful approximation using a calculator, or see the picto- 
rial explanation in Street-Fighting Mathematics [33, Section 4.3].) With that 
approximation. 



In (probability of no shared birthday) 



n 2 1 
y X 365' 



(4.60) 



When the probability of no shared birthday falls below 0.5, the probability 
of a shared birthday rises above 0.5. Therefore, the condition for having 
enough people is 



In (probability of no shared birthday) < In ; 



(4.61) 



Because In (1/2) = — In 2, the condition simplifies to 



2x365 



> In 2. 



(4.62) 
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Using the approximation In 2 « 0.7, the threshold n is approximately 22.6: 

n > Vln2 x 2 x 365 « 22.6. (4.63) 

People do not come in fractions, so we need 23 people. Indeed, the exact 
calculation for n = 23 gives the probability of sharing a birthday as 0.507! 

From this scaling analysis, we can compactly explain what went wrong 
with the plausible conjecture. It assumed that the shared-birthday prob- 
ability p is proportional to the number of people n, reaching p = 0.5 when 
n = 0.5 x 365. The handshake picture, however, shows that the probability 
is related to the number of handshakes, which is proportional to n 2 . What 
a difference from a simple change in a scaling exponent! 

Problem 4.24 Three people sharing a birthday 

Extend our scaling analysis to the three-birthday problem: How many people must 
be in a room before the probability of three people sharing a birthday rises above 
0.5? (The results of the exact calculation, along with many approximations, are 
given by Diaconis and Mosteller [10].) 

Problem 4.25 Scaling of bubble sort 

The simplest algorithm for sorting — for example, to sort a list of n web pages ac- 
cording to relevance — is bubble sort. You go through the list in passes, comparing 
neighboring items and swapping any that are out of order. 

a. How many passes through the list do you need in order to guarantee that the 
list is sorted? 

b. The running time t (the time to sort the list) is proportional to the number of 
comparisons. What is the scaling exponent /3 in f oc U’? 

Problem 4.26 Scaling of merge sort 

Bubble sort (Problem 4.25) is easy to describe, but there is an alternative that is 
almost as easy: merge sort. It is a recursive, divide-and-conquer algorithm. You 
divide the list of n items into two equal parts (assume that n is a power of 2), and 
sort each half using merge sort. To make the sorted list, just merge the two sorted 
halves. 

a. Here is a list of eight randomly generated numbers: 98, 33, 34, 62, 31, 58, 61, 
and 15. Draw a tree illustrating how merge sort works on this list. Use two 
lines for each internal node, showing on one line the original list and on the 
second line the sorted list. 

b. The running time t is proportional to the number of comparison operations. 
What are the scaling exponents a and /3 in 

t oc «*(logn)^? 



( 4 . 64 ) 
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c. If you were a revenue agency and had to sort the tax records of all residents in 
a country, which algorithm would you use, merge sort or bubble sort? 

Problem 4.27 Scaling of standard multiplication 

In the usual school algorithm for multiplying n digit numbers, you find and add 
partial products. If the running time t is proportional to the number of single-digit 
multiplications, what is the scaling exponent /3 in t oc iP? 

Problem 4.28 Scaling of Karatsuba multiplication 

The Karatsuba algorithm for multiplying two //-digit numbers, discovered by Ana- 
toly Karatsuba in 1960 [28] and published in 1962 [29], was the first development 
in the theory of multiplication in centuries. Similarly to merge sort, you first break 
each number into two equal-length halves. For example, you split 2743 into 27 
and 43. Using Karatsuba multiplication recursively, you form three products from 
those halves, and combine the three products to get the original product. The 
subdividing stops when the numbers are short enough to be multiplied by the 
computer's hardware (typically, at 32 or 64 bits). 

The expensive step, repeated many times, is hardware multiplication, so the run- 
ning time t is proportional to the number of hardware multiplications. Find the 
scaling exponent /3 in t <x nP (You will find a scaling exponent that occurs rarely 
in physical scalings, namely an irrational number.) How does this /3 compare with 
the exponent in the school algorithm for multiplication (Problem 4.27)? 



4.5 Logarithmic scales in two dimensions 

Scaling relations, which are so helpful in understanding the physical and 
mathematical worlds, have a natural representation on logarithmic scales, 
the representation that we introduced in Section 3.1.3 whereby distances 
correspond to factors or ratios rather than to differences. 

As an example, here is the gravitational strength g at a 
distance r from the center of a planet. Outside the planet, 
g oc r~- (the inverse-square law of gravitation). On lin- 
ear axes, the graph of g versus r looks curved, like a hy- 
perbola. However, its exact shape is hard to identify; the 
graph does not make the relation between g and r obvi- 
ous. For example, let's try to represent our favorite scal- 
ing analysis: When r doubles, from, say, R Earlll to 2R Earth , 
the gravitational acceleration falls from y Earth to g Eart h/4- The graph shows 
thaty(2R) is smaller than g(R)' r unfortunately, the scaling is hard to extract 
from these points or from the curve. 
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However, using logarithmic scales for both g and r — called 
log-log axes — makes the relation clear. Let's try again to 
represent that when r doubles, the gravitational accelera- 
tion falls by a factor of 4. Call 1 unit on either logarithmic 
scale a factor of 2. The factor of 2 increase in r corresponds 
to moving 1 unit to the right. The factor of 4 decrease in g 
corresponds to moving 2 units downward. Therefore, the 
graph of g versus r is a straight line — and its slope is —2. 

On logarithmic scales, scaling relations turn into straight 
lines whose slope is the scaling exponent. 

Problem 4.29 Sketching gravitational field strength 

Imagine, as in Problem 4.13, a uniform spherical planet with radius R. Sketch, on 
log-log axes, gravitational field strength g versus r, the distance from the center 
of the planet. Include the regions r < R and r > R. 



Many natural processes, often for unclear reasons, obey 
scaling relations. A classic example is Zipf's law. In its 


word 


rank 


frequency 


simplest form, it states that the frequency of the kth most 


the 
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7.0% 


common word in a language is proportional to 1/k. The 


of 


2 


3.5 


table gives the frequencies of the three most frequent Eng- 
lish words. For English, Zipf's law holds reasonably well 
up to k ~ 1000. 


and 


3 


2.9 


Zipf's law is useful for estimation. For example, suppose 


that you 


have to 



estimate the government budget of Delaware, one of the smallest US states 
(Problem 4.30). A key step in the estimate is the population. Delaware 
might be the smallest state, at least in area, and it is probably nearly the 
smallest in population. To use Zipf's law, we need the data for the most pop- 
ulous state. That information I happen to remember (information about the 
biggest item is usually easier to remember than information about the small- 
est item): The most populous state is California, with roughly 40 million 
people. Because the United States has 50 states, Zipf's law predicts that the 
smallest state will have a population 1 /50th of California's, or roughly 1 mil- 
lion. This estimate is very close to Delaware's actual population: 917 000. 

Problem 4.30 Government budget of Delaware 

Based on the population of Delaware, estimate its annual government budget. Then 
look up the value and check your estimate. 
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4 Proportional reasoning 



Optimizing flight speed 

With our skills in proportional reasoning, let's return to the energy and 
power consumption in forward flight (Section 3.6.2). In that analysis, we 
took the flight speed as a given; based on the speed, we estimated the power 
required to generate lift. Now we can estimate the flight speed itself. We 
will do so by estimating the speed that minimizes energy consumption. 



Finding the optimum speed 



Flying requires generating lift and fighting drag. To 
find the optimum flight speed, we need to estimate 
the energy required by each process. The lift was the 
subject of Section 3.6.2, where we estimated the power 
required as 



lift 



0 mg Y 



r vL 2 ' 



(4.65) 
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where m is the plane's mass, v is its forward velocity, and L is its wingspan. 
The lift energy required to fly a distance d is this power multiplied by the 
travel time d/v: 



£|ift 




(mg ) 2 

Paii v2 L 2 • 



(4.66) 



In fighting drag, the energy consumed is the drag force times the distance. 
The drag force is 



^drag r> ^dPavP 2 



(4.67) 



where c d is the drag coefficient. To simplify comparing the energies re- 
quired for lift and drag, let's write the drag force as 

^drag 1-Pair^ ^ / (4.68) 



where C is a modified drag coefficient: It doesn't use the 1 /2 that is part 
of the usual combination c d /2, and it is measured relative to the squared 
wingspan L 2 rather than to the cross-sectional area A cs . 



For a 747, which drag coefficient, c d or C, is smaller? 
The drag force has two equivalent forms: 
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(4.69) 



► 



Thus, CL 2 = c d A cs /2. For a plane, the squared wingspan L 2 is much larger 
than the cross-sectional area, so C is much smaller than c d . 



With the new form for F d rag , the drag energy is 

E drag = Cp ail V 2 L 2 d, 

and the total energy required for flying is 
r ( mg ) 2 



-'total 



Fair V 2 L 2 



d + Cp ail v 2 L 2 d. 



(4.70) 



(4.71) 



G drag 




This formula looks intimidating because of the many para- 
meters such as m, g, L, and p air . As our interest is the flight 
speed v (in order to find the total energy), let's use propor- 
tional reasoning to reduce the energies to their essentials: 

E drag “ 1)2 and E lift °c v~ 2 . On log-log axes, each relation is 
a straight line with slope +2 for drag and slope —2 for lift. Because the re- 
lations have different slopes, corresponding to different scaling exponents, 
their graphs must intersect. 



To interpret the intersection, let's incorporate a sketch of the 
total energy E total = E dr ag+^ufe At low speeds, the dominant 
consumer of energy is lift because of its v~ 2 dependence, so 
the sketch of E total follows E lift . At high speeds, the dominant 
consumer of energy is drag because of its v 2 dependence, so 
the sketch of E total follows E drag . Between these extremes is 
an optimum speed that minimizes the energy consumption. 
(Because the axes are logarithmic, and log(fl + b) ^ log a + 
log b, the sum of the two straight lines is not straight!) This 
optimum speed, marked u specia i, is the speed at which E drag 
and E Hft cross. 

Why is the optimum right at the crossing speed, rather than faster 
or slower than the crossing speed? 




Symmetry! Try Problem 4.31 and then look afresh at Section 3.2.3, where 
we minimized E total without the benefit of log-log axes. 
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4 Proportional reasoning 



Flying faster or slower than the optimum speed means consuming more 
energy. That extra consumption cannot always be avoided. A plane is de- 
signed so that its cruising speed is its minimum-energy speed. At takeoff 
and landing, when it flies far below the minimum-energy speed, a plane 
must work harder to stay aloft, which is one reason that the engines are so 
loud at takeoff and landing (another reason is that the engine noise reflects 
off the ground and back to the plane). 

The optimization constraint, that a plane flies at the minimum-energy speed, 
allows us to eliminate v from the total energy. As we saw on the graph, at 
the minimum-energy speed, the drag and lift energies are equal: 

d ~ Cp ail v 2 L 2 d, (4.72) 

Fift ^drag 

or, solving for mg (and rejoicing that d canceled out), 

mg ~ 4Cp ail V Z L Z . (4.73) 



Now we could solve for v explicitly (and you get to find and use that so- 
lution in Problems 4.34 and 4.36). However, here we are interested in the 
total energy, not the flight speed itself. To find the energy without finding 
the speed, notice a reusable idea, an abstraction, within the right side of the 
equation for mg — which otherwise looks like a mess. 



Namely, the mess contains p ail v z L 2 , which is Fdrag /C. Therefore, when the 
plane is flying at the minimum-energy speed. 



mg ~ 4C 




(4.74) 



SO 

^drag ~ V Cmg . (4.75) 

Thanks to our abstraction and all the surrounding approximations, we learn 
a surprisingly simple relation between the drag force and the plane's weight, 
connected through the square root of our modified drag coefficient C. 

The energy consumed by drag — which, within a factor of 2, is also the total 
energy — is the drag force times the distance d. Therefore, the total energy 
is given by 

£ total ~ Edrag ~ JCmgd. 

^drag^ 



(4.76) 
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By using an optimum flight speed, we have eliminated the flight speed v 
from the total energy. 

Does the total energy depend in reasonable ways upon m, g, C, and d? 

Yes! First, lift overcomes the weight mg; therefore, the energy should, and 
does, increase with mg. Second, a streamlined plane (low C) should use 
less energy than a bluff, blocky plane (high C); the energy should, and does, 
increase with the modified drag coefficient C. Finally, because the plane is 
flying at a constant speed, the energy should be, and is, proportional to the 
travel distance d. 



4.6.2 Flight range 



From the total energy, we can estimate the range of the 747 — the distance 
that it can fly on a full tank of fuel. The energy is 40mgd, so the range d is 



d 



Efuel 

40 mg' 



(4.77) 



where E fuel is the energy in the full tank of fuel. To estimate d, we need to 
estimate E fuel , the modified drag coefficient C, and maybe also the plane's 
mass m . 



► 



How much energy is in a full tank of fuel? 



The fuel energy is the fuel mass times its energy density (as an energy per 
mass). Let's describe the fuel mass relative to the plane's mass m, as a frac- 
tion f> of the plane's mass: m fuel = f m. Using E fuc | for the energy density of 
fuel, 

Efuel = jSmfifue!. (4.78) 



When we put this expression into the range d, the plane's mass m in E fuel , 
which is the numerator of d, cancels the m in the denominator 40 mg . The 
range then simplifies to a relation independent of mass: 



Efuel 
40 g ' 



(4.79) 



For the fuel fraction f>, a reasonable guess for long-range flight is f> ~ 0.4: a 
large portion of the payload is fuel. For the energy density T lut ,|, we learned 
in Section 2.1 that <S fuel is roughly 4 x 10 7 joules per kilogram (9 kilocalories 
per gram). This energy density is what a perfect engine would extract. At 
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the usual engine efficiency of one-fourth, £ fue] becomes roughly 10 7 joules 
per kilogram. 

How do we find the modified drag coefficient C? 

This estimate is the trickiest part of the range estimate, because C needs to 
be converted from more readily available data. According to Boeing, the 
manufacturer of the 747, a 747 has a drag coefficient of C' ~ 0.022. As the 
prime symbol indicates, C' is yet another drag coefficient. It is measured 
using the wing area A wing and uses the traditional 1/2: 

^drag ^ '^'WingPair® • (4.80) 

So we have three drag coefficients, depending on whether the coefficient is 
referenced to the cross-sectional area A cs (the usual definition of c d ), to the 
wing area A wing (Boeing's definition of C'), or to the squared wingspan L 2 
(our definition of C, which also lacks the factor of 1 / 2 that the others have). 

To convert between these definitions, compare the three ways to express the 
drag force: 





\ 

-C'A wtag (Boeing's definition). 








^"drag Pair ^ * 


CL 2 (our definition). 




(4.81) 




1 

-c d A cs (the usual definition). 








From this comparison. 








—C A — CT 2 

2 ^ 1 x wing K -' L -‘ 


2 C d^cs- 




(4.82) 


The conversion from C to C is therefore 
A 

r l r , ZT wing 

2 L 2 ' 




(4.83) 


Using / for the wing's front-to-back length, the wing 
area becomes A wing = LI. Then the area ratio A wing /L 2 
simplifies to l/L (the reciprocal of the wing aspect ra- 


wing 


1 

T 

i 

i 




tio), and the drag coefficient C simplifies to 


body 


L 




C = -C- 
2 V 


(4.84) 


wing 


1 

1 

± 





For a 747, l « 10 meters, L « 60 meters, and C ~ 0.022, 
so C w 1/600: 
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C ~ \ x 0.022 x 

2 60 m 600 

The resulting range is roughly 10 000 kilometers: 



P £(uel 

0.4 x 10 7 Jkg -1 
,/ 1/600 x 10ms -2 

~ O 



10 7 m = 10 4 km. 



(4.85) 



(4.86) 



The actual maximum range of a 747-400 is 13450 kilometers (a figure we 
used in Section 4.3.2): Our approximate analysis is amazingly accurate. 



Problem 4.31 Graphical interpretation of minimization using symmetry 

In Section 3.2.3, we started with the general form for the total energy 

E ~ — + Bv 2 (4.87) 

V 1 

hift ^drag 



and found the optimum (minimum-energy) speed by constructing the following 
symmetry operation: 



^ JA/B 



(4.88) 



V 

On the log-log plot of the energies versus v, what is the geometric interpretation 
of this symmetry operation? 



Problem 4.32 Minimum-power speed 

We estimated the flight speed v E that minimizes energy consumption. We could 
also have estimated v P , the speed that minimizes power consumption. What is the 
ratio i’p/i’e? Before doing an exact calculation, sketch P lift , P dra g, and P lift + P dra g 
(on log-log axes) and place i’p on the correct side of i> E . Then check your placement 
against the result of the exact calculation. 



Problem 4.33 Coefficient of lift 

Just as we can define a dimensionless drag coefficient c d , where 
1 , , 

^"drag — 2 C dPaird^ J , (4.89) 

we can define a dimensionless lift coefficient c L , where 
1 

Tlift = 2 C LPairdP 2 , (4.90) 

where the area A here is usually the wing area. (The same formula structure shows 
up in drag and lift — abstraction!) Estimate c L for a 747 in cruising flight. 
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4.6.3 Flight range versus size 

In proportional reasoning, we ask, "How does one quantity change when 
we change an independent variable (for example, if we double an indepen- 
dent variable)?" Because flying objects come in such a wide range of sizes, 
a natural independent variable is size. 

How does the flight range depend on the plane's size? 



Let's assume that all planes are geometrically similar — that is, that they 
have the same shape and differ only in their size. Now let's see how chang- 
ing the plane's size changes the quantities in the range 



~ 4c g - 



( 4 . 91 ) 



The gravitational acceleration g remains fixed. The fuel energy density £ fuel 
and the fuel fraction f also remain fixed — which shows again the benefit of 
using intensive quantities, as we discussed in Section 4.2.4 when we made 
a scaling relation for planetary surface gravity. Finally, the drag coefficient 
C depends only on the plane's shape (on how streamlined it is), not on its 
size, so it too remains constant. Therefore, the flight range is independent 
of the plane's size! 



A further surprise comes from comparing the range of planes with the 
range of migrating birds. The proportionality d ex fi£ fuei / fC is shorthand 
for the comparison 



^ plane 



*bird 



a pfuel 

Pplane plane 

^bird fibdKl 



c 



plane 



Cus 



bird 



- 1/2 



( 4 . 92 ) 



The ratio of fuel fractions is roughly 1: For the plane, f> ~ 0.4; and a bird, 
having eaten all summer, is perhaps 40 percent fat (the bird's fuel). The 
energy density in jet fuel and fat is similar, as is the efficiency of engines 
and animal metabolism (at about 25 percent). Therefore, the ratio of energy 
densities is also roughly 1. Finally, a bird has a similar shape to a plane, 
so the ratio of drag coefficients is also roughly 1. Therefore, planes and 
well-fattened migrating birds should have a similar maximum range, about 
10 000 kilometers. 



Let's check. The longest known nonstop flight by an animal is 11 680 kilo- 
meters: made by a bar-tailed godwit tracked by satellite by Robert Gill and 
his colleagues [19] as the bird flew nonstop from Alaska to New Zealand! 
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4.7 Summary and further problems 

Proportional reasoning focuses our attention on how one quantity deter- 
mines another. (A wonderful collection of pointers to further reading is 
the American Journal of Physics's "Resource Letter" on scaling laws [49].) By 
guiding us toward what is often the most important characteristic of a prob- 
lem, the scaling exponent, it helps us discard spurious complexity. 

Problem 4.34 Cruising speed versus mass 

For geometrically similar animals (the same shape and composition but different 
sizes) in forward flight, how does the animal's minimum-energy flight speed v 
depend on its mass ml In other words, what is the scaling exponent /3 in v oc m^l 

Problem 4.35 Hovering power versus size 

In Section 3.6.1, we derived the power required to hover. For geometrically simi- 
lar birds, how does the power per mass depend on the animal's size LI In other 
words, what is the scaling exponent y in Phover/ m x h 7 ? Why are there no large 
hummingbirds? 

Problem 4.36 Cruising speed versus air density 

How does a plane's (or a bird's) minimum-energy speed v depend on p air ? In other 
words, what is the scaling exponent y in v oc pf r l 

Problem 4.37 Speed of a bar -tailed godwit 

Use the results of Problem 4.34 and Problem 4.36 to write the ratio z’ 747 A’godwit as 
a product of dimensionless factors, where v 7i7 is the minimum-energy (cruising) 
speed of a 747, and Ug 0t j W it is the minimum-energy (cruising) speed of a bar-tailed 
godwit. Using Mg 0 dwit ~ 400 grams, estimate the cruising speed of a bar-tailed 
godwit. Compare your result to the average speed of the record-setting bar-tailed 
godwit that was studied by Robert Gill and his colleagues [19], which made its 
11 680-kilometer journey in 8.1 days. 

Problem 4.38 Thermal resistance of a house versus a tea mug 

When we developed the analogy between low-pass electrical and thermal filters 
(Section 2.4.5) — whether RC circuits, tea mugs, or houses — we introduced the ab- 
straction of thermal resistance ^thermal- hi this problem, you estimate the ratio of 
thermal resistances dermal/ 

House walls are thicker than teacup walls. Because thermal resistance, like electri- 
cal resistance, is proportional to the length of the resistor, the house's thicker walls 
raise its thermal resistance. However, the house's larger surface area, like having 
many resistors in parallel, lowers the house's thermal resistance. Estimate the size 
of these two effects and thus the ratio of the two thermal resistances. 
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Problem 4.39 General birthday problem 

Extend the analysis of Problem 4.24 to k people sharing a birthday. Then compare 
your predictions to the exact results given by Diaconis and Mosteller in [10]. 

Problem 4.40 Flight of the housefly 

Estimate the mechanical power required for a common housefly Musca domestica 
(m ~ 12 milligrams) to hover. From everyday experience, estimate its typical flight 
speed. At this flight speed, compare the power requirements for forward flight and 
for hovering. 
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In 1906, Los Angeles received 540 millimeters of precipitation (rain, snow, 
sleet, and hail). 

Is this rainfall large or small? 

On the one hand, 540 is a large number, so the rainfall is large. On the other 
hand, the rainfall is also 0.00054 kilometers, and 0.00054 is a tiny number, 
so the rainfall is small. These arguments contradict each other, so at least 
one must be wrong. Here, both are nonsense. 

A valid argument comes from a meaningful comparison — for example, com- 
paring 540 millimeters per year with worldwide average rainfall — which 
we estimated in Section 3.4.3 as 1 meter per year. In comparison to this rain- 
fall, Los Angeles in 1906 was dry. Another meaningful comparison is with 
the average rainfall in Los Angeles, which is roughly 350 millimeters per 
year. In comparison, 1906 was a wet year in Los Angeles. 

In the nonsense arguments, changing the units of length changed the re- 
sult of the comparison. In contrast, the meaningful comparisons are inde- 
pendent of the system of units: No matter what units we select for length 
and time, the ratio of rainfalls does not change. In the language of symme- 
try, which we met in Chapter 3, changing units is the symmetry operation. 
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and meaningful comparisons are the invariants. They are invariant because 
they have no dimensions. When there is change, look for what does not 
change: Make only dimensionless comparisons. 

This criterion is necessary for avoiding nonsense; however, it is not suffi- 
cient. To illustrate the difficulty, let's compare rainfall with the orbital speed 
of the Earth. Both quantities have dimensions of speed, so their ratio is in- 
variant under a change of units. However, judging the wetness or dryness 
of Los Angeles by comparing its rainfall to the Earth's orbital speed pro- 
duces nonsense. 

Here is the moral of the preceding comparisons. A quantity with dimen- 
sions is, by itself, meaningless. It acquires meaning only when compared 
with a relevant quantity that has the same dimensions. This principle un- 
derlies our next tool: dimensional analysis. 

Problem 5.1 Book boxes are heavy 

In Problem 1.1, you estimated the mass of a small moving-box packed with books. 
Modify your calculation to use a medium moving-box, with a volume of roughly 
0.1 cubic meters. Can you think of a (meaningful!) comparison to convince some- 
one that the resulting mass is large? 

Problem 5.2 Making energy consumption meaningful 

The United States' annual energy consumption is roughly 10 20 joules. Suggest two 
comparisons to make this quantity meaningful. (Look up any quantities that you 
need to make the estimate, except the energy consumption itself!) 

Problem 5.3 Making solar power meaningful 

In Problem 3.31, you should have found that the solar power falling on Earth is 
roughly 10 17 watts. Suggest a comparison to make this quantity meaningful. 

Problem 5.4 Energy consumption by the brain 

The human brain consumes about 20 watts, and has a mass of 1-2 kilograms. 

a. Make the power more meaningful by estimating the brain's fraction of the 
body's power consumption: 

brain power 
basal metabolism ’ 

b. Make this fraction even more meaningful by estimating the ratio 

the brain's fraction of the body's power consumption 
the brain's fraction of the body's mass 





5.1 Dimensionless groups 



139 



Problem 5.5 Making oil imports meaningful 

In Section 1.4, we estimated that the United States imports roughly 3 x 10 9 barrels 
of oil per year. This quantity needs a comparison to make it meaningful. As one 
possibility, estimate the ratio 

cost of the imported oil 

. ( 5 . 3 ) 

US military spending to "defend" oil-rich regions 

If this ratio is less than 1, suggest why the US government does not cancel that 
part of the military budget and use the savings to provide US consumers with free 
imported oil. 

Problem 5.6 Making the energy in a 9-volt battery meaningful 

Using your estimate in Problem 1.11 for the energy in a 9-volt battery, estimate 
energy content of the battery 
cost of the battery 

Compare that quotient to the same quotient for electricity from the wall socket. 



5.1 Dimensionless groups 

Because dimensionless quantities are the only meaningful quantities, we 
can understand the world better by describing it in terms of dimensionless 
quantities. But we need to find them. 



5.1.1 Finding dimensionless groups 



To illustrate finding these dimensionless quantities, let's try 
an example that uses familiar physics: When learning a new 
idea, it is helpful to try it on a familiar example. We'll find a 
train's inward acceleration as it moves on a curved track. The 
larger the acceleration, the more the track or the train needs 
to tilt so that the passengers do not feel uncomfortable and (if 
the track is not tilted enough) the train does not tip over. 



V 




Our goal is the relation between the train's acceleration a, its 

speed v, and the track's radius of curvature r. In our state of knowledge 

now, the relation could be almost anything. Here are a few possibilities: 

a + v 2 v 3 r + a 2 a 2 v a + v , , 

/ j / o 9 • (5.5) 

r a v v + r ra + v 3 r 2 



Although those possibilities are bogus, among the vast sea of possible rela- 
tions, one relation is correct. 
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To find the constraint that will shrink the sea to a few drops, first defocus 
our eyes and see all the choices as examples of the general form 




some function of another function of 



a, v, and r a, v, and r 

Even though the blobs might be complicated functions, they must have 
identical dimensions. By dividing both sides by blob B, we get a simpler 
form: 



blob A 




Now each side is dimensionless. Therefore, whatever the relation between 
a, v, and r, we can write it in dimensionless form. 



The preceding process is not limited to this problem. In any valid equa- 
tion, all the terms have identical dimensions. For example, here is the total 
energy in a spring-mass system: 



^ total = \™ 2 + ^kx 2 . 



(5.8) 



The kinetic-energy term (mv 2 / 2) and the potential-energy term (kx 2 / 2) have 
the same dimensions (of energy). The same process of dividing by one of 
the terms turns any equation into a dimensionless equation. As a result, any 
equation can be written in dimensionless form. Because the dimensionless 
forms are a small fraction of all possible forms, this restriction offers us a 
huge reduction in complexity. 



To benefit from this reduction, we must compactly describe all such forms: 
Any dimensionless form can be built from dimensionless groups. A dimen- 
sionless group is the product of the quantities, each raised to a power, such 
that the product has no dimensions. In our example, where the quantities 
are a, v, and r, any dimensionless group G has the form 



G = a x v y r z , 



(5.9) 
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where G has no dimensions and where the exponents x, y, and z are real 
numbers (possibly negative or zero). 

Because any equation describing the world can be written in dimensionless 
form, and because any dimensionless form can be written using dimension- 
less groups, any equation describing the world can be written using dimen- 
sionless groups. 

That news is welcome, but how do we find these groups? 

The first step is to tabulate the quantities with their descrip- a LT -2 acceleration 
tions and dimensions. By convention, capital letters are v LT -i S p ee q 

used to represent dimensions. For now, the possible di- L radius 

mensions are length (L), mass (M), and time (T). Then, for 

example, the dimensions of v are length per time or, more compactly, LT -1 . 

Then, by staring at the table, we find all possible dimensionless groups. A 
dimensionless group contains no length, mass, or time dimensions. For our 
example, let's start by getting rid of the time dimension. It occurs in a as 
T~~ and in v as T -1 . Therefore, any dimensionless group must contain a/v 2 . 

This quotient has dimensions of L -1 . To make it dimensionless, multiply 
it by the only quantity that is purely a length, which is the radius r. The 
result, ar/v 2 , is a dimensionless group. In the a x v y r z form, it is a 1 v~ 2 r 1 . 

Are there other dimensionless groups? 

To get rid of time, we started with a/v 2 and then ended, inevitably, with 
the group ar/v 2 . To make another dimensionless group, we would have to 
choose another starting point. However, the only starting points that get rid 
of time are powers of a/v 2 — for example, v 2 / a or a 2 /v 2 — and those choices 
lead to the corresponding power of ar/v 2 . Therefore, any dimensionless 
group can be formed from ar/v 2 . Our three quantities a, r, and v produce 
exactly one independent dimensionless group. 

As a result, any statement about the circular acceleration can be written 
using only ar/v 2 . All dimensionless statements using only ar/v 2 have the 
general form 
UT 

— = dimensionless constant, (5.10) 

v l 

because there are no other independent dimensionless groups to use on the 
right side. 
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Why can't we use ar/v 2 on the right side? 

We can, but it doesn't create new possibilities. As an example, let's try the 
following dimensionless form: 




Its solution is ar/v 2 = 1 /2, which is another example of our general form 
UT 

— = dimensionless constant. (5.12) 

v l 

But what if we use a more complicated function? 



Let's try one: 




Its solutions are 
ar_ _ f <p, 
v 2 “ l-l !<p, 



(5.13) 



(5.14) 



where <p is the golden ratio (1.618...). Deciding between these solutions 
would require additional information or requirements, such as the sign con- 
vention for the acceleration. Even so, each solution is another example of 
the general form 
CIV 

— = dimensionless constant. (5.15) 

v z 



Using that form, which we cannot escape, the acceleration of the train is 
v 2 

a , (5.16) 

r 



where the ~ contains the (unknown) dimensionless constant. In this case, 
the dimensionless constant is 1. However, dimensional analysis, as this 
procedure is called, does not tell us its value — which would come from a 
calculus analysis or, approximately, from a lumping analysis (Section 6.3.4). 

Using a ~ v 2 /r, we can now estimate the inward acceleration of the train 
going around a curve. Imagine a moderately high-speed train traveling at 
v ~ 60 meters per second (approximately 220 kilometers or 135 miles per 
hour). At such speeds, railway engineers specify that the track's radius of 
curvature be at least 2 or 3 kilometers. Using the smaller radius of curvature, 
the inward acceleration becomes 
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(60ms 1 ) 2 
2 x 10 3 m 



1.8ms 2 . 



(5.17) 



Because no quantity with dimensions is large or small on its own, 
this acceleration by itself is meaningless. It acquires meaning in 
comparison with a relevant acceleration: the gravitational accel- 
eration g. For this train, the dimensionless ratio n/g is approx- 
imately 0.18. This ratio is also tan f), where 6 is the train's tilt 
that would make the passengers feel a net force perpendicular to 
the floor. With n/g « 0.18, the comfortable tilt angle is approx- 
imately 10°. Indeed, tilting trains can tilt up to 8°. (This range 
is usually sufficient, as a full tilt is disconcerting: One would see 
a tilted ground but would still feel gravity acting as it normally 
does, along one's body axis.) 

Using our formula for circular acceleration, we can also es- 
timate the maximum walking speed. In walking, one foot 
is always in contact with the ground. As a rough model of '' 
walking, the whole body, represented as a point mass at its 
center of mass (CM), pivots around the foot in contact with leg 1 

the ground — as if the body were an inverted pendulum. If 
you walk at speed v and have leg length Z, then the result- 
ing circular acceleration (the acceleration toward the foot) is 
v 2 /l. When you walk fast enough such that this acceleration is more than g, 
gravity cannot supply enough acceleration. 

This change happens when v ~ Jgl. Then your foot leaves the ground, and 
the walk turns into a run. Therefore, gait is determined by the dimension- 
less ratio v 2 /gl. This ratio, which also determines the speed of water waves 
(Problem 5.15) and of ships (Problem 5.64), is called the Froude number 
and abbreviated Fr. 




Here are three ways to check whether our pendulum model of walking is 
reasonable. First, the resulting formula, 

Vx ~ VF (s- 18 ) 

explains why tall people, with a longer leg length, generally walk faster 
than short people. 

Second, it predicts a reasonable maximum walking speed. With a leg length 
of l ~ 1 meter, the limit is 3 meters per second or about 7 miles per hour: 

Wmx ~ VlOms -2 x lm « 3 ms -1 . (5.19) 
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This prediction is consistent with world-record racewalking speeds, where 
the back toe may not leave the ground until the front heel has touched the 
ground. The world records for 20-kilometer racewalking are lh:24m:50s 
for women and lh:17m:16s for men. The corresponding speeds are 3.9 and 
4.3 meters per second. 

The third test is based on gait. In a fascinating experiment, Rodger Kram 
and colleagues [31] reduced the effective gravity g. This reduction changed 
the speed of the walking-running transition, but the speed still satisfied 
v 2 /gl ~ 0.5. The universe cares only about dimensionless quantities! 

Another moral of this introduction to dimensional analysis is that 
every dimensionless group is an abstraction. Here, the group 
ar/v 2 tells us that the universe cares about a, r, or v through 
the combination ar/v 2 . This relation is described by the tree dia- 
gram (the edge label of —2 is the exponent of v mv~ 2 ). Because 
there is only one independent dimensionless group, the universe 
cares only about ar/v 2 . Therefore, the universe cares about a, r, and v only 
through the combination ar/v 2 . This freedom not to worry about individ- 
ual quantities simplifies our picture of the world. 

5.1.2 Counting dimensionless groups 

Finding the circular acceleration required finding all possible dimension- 
less groups and showing that all groups could be constructed from one 
group — say, ar/v 2 . That reasoning followed a chain of constraints: To get 
rid of the time, the dimensionless group had to contain the quotient a/v 2 ; 
to get rid of the length, the quotient needed to be multiplied by the correct 
power of r. 

For each problem, do we have to construct a similar chain of reasoning in order to 
count the dimensionless groups? 

Following the constraints is useful for finding dimensionless groups, but 
there is a shortcut for counting independent dimensionless groups. The 
number of independent groups is, roughly, the number of quantities minus 
the number of dimensions. A more precise statement will come later, but 
this version is enough to get us started. 

Let's test it using the acceleration example. There are three quantities: a, v, 
and r. There are two dimensions: length (L) and time (T). There should be, 
and there is, one independent dimensionless group. 
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Let's also test the shortcut on a familiar physics formula: w MLT -2 
W = mg, where W is an object's weight, rn is its mass, and g m M 
is the gravitational acceleration. There are three quantities g LT~ 2 
(W, m, and g) made from three dimensions (M, L, and T). 

Our shortcut predicts no dimensionless groups at all. However, W /mg is 
dimensionless, which refutes the prediction! 

What went wrong? 

Although the three quantities seem to contain three dimensions, the three 
dimensional combinations actually used — force (MLT -2 ), mass (M), and 
acceleration (LT -2 ) — can be constructed from just two dimensions. For ex- 
ample, we can construct them from mass and acceleration. 

But acceleration is not a fundamental dimension, so how can we use it? 

The notion of fundamental dimensions is a human convention, part of our 
system of measurement. Dimensional analysis, however, is a mathematical 
process. It cares neither about the universe nor about our conventions for 
describing the universe. That lack of care may appear heartless and seem 
like a disadvantage. However, it means that dimensional analysis is inde- 
pendent of our arbitrary choices, giving it power and generality. 

As far as dimensional analysis is concerned, we can choose w Mx[«] 
any set of dimensions as our fundamental dimensions. The m M 
only requirement is that they suffice to describe the dimen- g [a] 
sions of our quantities. Here, mass and acceleration suffice, 
as the rewritten dimensions table shows: Using the notation \a \ for "the 
dimensions of a," the dimensions of W are M x [a] and the dimensions of 
g are just [a]. 

In summary, the three quantities contain only two independent dimensions. 
Three quantities minus two independent dimensions produce one indepen- 
dent dimensionless group. Accordingly, the revised counting shortcut is 

number of quantities 

— number of independent dimensions (5.20) 

number of independent dimensionless groups 

This shortcut, known as the Buckingham Pi theorem [4], is named after 
Edgar Buckingham and his uppercase Pi (n) notation for dimensionless 
groups. (It is also the rank-nullity theorem of linear algebra [42].) 



weight 

mass 

gravity 



weight 

mass 

gravity 
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Problem 5.7 Bounding the number of independent dimensionless groups 

Why is the number of independent dimensionless groups never more than the 
number of quantities? 

Problem 5.8 Counting dimensionless groups 

How many independent dimensionless groups do the following sets of quantities 
produce? 

a. period T of a spring-mass system in a gravitational field: T, k (spring constant), 
m, r 0 (amplitude), and g. 

b. impact speed v of a free-falling object: v, g, and h (initial drop height). 

c. impact speed v of a downward- thrown free-falling object: v, g, h, and (launch 
velocity). 

Problem 5.9 Using angular frequency instead of speed 

Redo the dimensional-analysis derivation of circular acceleration using the radius 
r and the angular frequency cv as independent variables. Using a = v 2 /r, find the 
dimensionless constant in the general dimensionless form 

dimensionless group proportional to a = dimensionless constant. (5.21) 

Problem 5.10 Impact speed of a dropped object 

Use dimensional analysis to estimate the impact speed of a freely falling rock that 
is dropped from a height h. 

Problem 5.11 Speed of gravity waves on deep water 

In this problem, you use dimensional analysis to 
find the speed of waves on the open ocean. These 
waves are the ones you would see from an airplane 
and are driven by gravity. Their speed could de- 
pend on gravity g, their angular frequency a\ and 
the density of water p. Do the analysis as follows. 

a. Explain why these quantities produce one independent dimensionless group. 

b. What is the group proportional to v? 

c. With the further information that the dimensionless constant is 1, predict the 
speed of waves with a period of 17 seconds (you can measure the period by 
timing the interval between each wave's arrival at the shore). This speed is 
also the speed of the winds that produced the waves. Is it a reasonable wind 
speed? 

d. What would the dimensionless constant be if, in the table of quantities, angular 
frequency u> is replaced by the period T? 
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Problem 5.12 Using period instead of speed 

In finding a formula for circular acceleration (Section 5.1.1), our independent vari- 
ables were the radius r and the speed v. Redo the dimensional-analysis derivation 
using the radius r and the period T as the independent variables. 

a. Explain why there is still only one independent dimensionless group. 

b. What is the independent dimensionless group proportional to a? 

c. What dimensionless constant is this group equal to? 



5.2 One dimensionless group 

The most frequent use of dimensional analysis involves one independent 
dimensionless group — for example, nr /v 2 in our analysis of circular accel- 
eration (Section 5.1.1). Let's look at this kind of case more closely, which 
has lessons for more complicated problems. 



5.2.1 Universal constants 

What we will find is that dimensional analysis reduces complexity by 
generating results with wide application. That is, in the general form 

independent dimensionless group = dimensionless constant, (5.22) 

the dimensionless constant is universal. Let's see what universal means 
through an example — the analysis of a small-amplitude pendulum. So, i 
imagine releasing a pendulum from a small angle 9 0 . 

What is its period of oscillation? 




The first step in dimensional analysis is to list the relevant 
quantities. The list begins with the goal quantity — here, 
the period T. It depends on gravity g, the string length 
Z, perhaps the mass m of the bob, and perhaps the ampli- 
tude 6q. Fortunately, when the amplitude is small, as it is 
here, then the amplitude turns out not to matter, so we won't list it. 

What are the dimensionless groups? 
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period 
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M 


mass of bob 



These four quantities contain three independent dimensions (M, L, and T). 
By the Buckingham Pi theorem (Section 5.1.2), they produce one indepen- 
dent dimensionless group. This group cannot contain m, because m is the 
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only quantity with dimensions of mass. A simple dimensionless group is 
gT 2 /l. However, because our eventual goal is the period T, let's make a 
dimensionless group proportional to T itself rather than to T 2 . This group 
is Tjg/l . Then the most general dimensionless statement is 




( 5 . 23 ) 



where C is a dimensionless constant. Even though its value is, for the mo- 
ment, unknown, it is universal. The same constant applies to a pendulum 
with a shorter string or, perhaps more surprisingly, to a pendulum on Mars, 
with its different gravitational strength. If we find the constant for one pen- 
dulum on one planet, we know it for all pendulums on all planets. 



There are at least three ways to find C. The first is to solve the pendulum 
differential equation — which is hard work. The second approach is to solve 
a cleverly designed simpler problem (Problem 3.4). Although the approach 
is clever, it is not as general as the third method. 



The third method is to measure C with a home experiment! For that 
purpose, I turned my key ring into a pendulum by hanging it from a 
string. The string was roughly twice the length of American letter paper 
(thus, 2x11 inches) and the period was roughly 1.5 seconds. Therefore, 

r v l~S~ i r / 32 ft s — 2 , 

c = T v y ~ L5s x v 2 ft = 6 ~ 5,24 



A 



i 



O - *- 

key ring 



► 



Would the constant be different in a modern system of units? 



In metric units, g ~ 9.8 meters per second per second, / « 0.6 meters, and 
C is, within calculation inaccuracies, still 6: 

C « 1 . 5 s x « 6. (5.25) 

V 0.6 m 

The system of units does not matter — which is the reason for using dimen- 
sionless groups: They are invariant under a change of units. Even so, an 
explicit 6 probably would not appear if we solved the pendulum differen- 
tial equation honestly. But a more precise measurement of C might suggest 
a closed form for this dimensionless constant. 

The pendulum length, from the knot where I hold it to the center of the key 
ring is 0.65 meters (slightly longer than the rough estimate of 0.6 meters). 
Meanwhile, 10 periods took 15.97 seconds. Then C is close to 6.20: 
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C « 1.597 s x 



9.8 ms 



-2 



6 . 20 . 



0.65 m 

What dimensionless numbers could produce 6.20? 



(5.26) 



This value is close to 2n, which is approximately 6.28. That guess feels 
like a leap, but have courage. The guess is plausible once we remember 
that pendulums oscillate and oscillations often involve 2n, or that we are 
asking for the period rather than the angular frequency, a choice that often 
introduces a 2n (as in Problem 5.11(d)). The resulting period T is 



T = 




(5.27) 



(For a physical explanation of the 2n, try Problem 3.4.) 

This example shows how dimensional analysis, a mathematical approach, 
sits between two physical approaches. First we used our physical knowl- 
edge to list the relevant quantities. Then we reduced the space of possi- 
ble relations by using dimensional analysis. Finally, to find the universal 
constant C, which dimensional analysis could not tell us, we again used 
physical knowledge (a home experiment). 

Problem 5.13 Your own measurement 

Make your own pendulum and measure the universal constant Tjg/1 . 

Problem 5.14 Period of a spring-mass system 

Use dimensional analysis to find, except for a dimensionless constant, the period 
T of a spring-mass system with spring constant k, mass m, and amplitude x 0 . Find 
the dimensionless constant in the most general dimensionless statement 

group proportional to T — dimensionless constant. (5.28) 

Problem 5.15 Speed of waves in shallow water 

In shallow water, where the wavelength is much larger than the depth, waves dri- 
ven by gravity travel at a speed that could depend on gravity g, the water depth h, 
and the density of water p. 

a. Find the independent dimensionless group proportional to v 2 , where v is the 
wave speed. Compare this group to the Froude number, the dimensionless 
ratio that we introduced to study walking (Section 5.1.1). 

b. What therefore is the scaling exponent /3 in the scaling 
relation v oc Test your prediction by measuring 
z'(lcm) and z>(4cm). To make the measurement, fill 
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a baking dish with water and excite a sloshing wave by slightly lifting and 
quickly setting down one end of the dish. 

c. Using your data, estimate the universal (dimensionless) constant in the relation 

dimensionless group from part (a) = dimensionless constant. (5.29) 

d. Predict the speed of tidal waves, which are (shallow-water!) waves created by 
underwater earthquakes. How long does a tidal wave take to cross an ocean? 



5.2.2 Atomic blast energy 



Problems with only one dimensionless group need not be simple or 
easy to by other methods. A classic example is finding the energy re- 
leased by the first atomic bomb, detonated in the New Mexico desert 
in 1945. The blast energy, or yield, was top secret. Yet declassified 
photographs of the blast, because they had a scale bar, provided data 
on the radius of the fireball at several times after the explosion. The 
data must have seemed innocuous enough to release. 



t (ms) 


R( m) 


3.26 


59.0 


4.61 


67.3 


15.0 


106.5 


62.0 


185.0 



But from these data, G. I. Taylor of Cambridge University pre- 
dicted the yield [44]. The analysis, like many calculations in 
fluid mechanics, is long and complicated. We'll instead use 
dimensional analysis to find the relation between the blast ra- 
dius R, the time t since the blast, the blast energy E, and the air density p. 
Then we'll use the R(t) data to predict E. 




These four quantities contain three independent di- 
mensions. Thus, they produce one independent di- 
mensionless group. To find it, let's eliminate one 
dimension at a time. Here, a bit of luck simplifies 
the process, because we have dimensions that are 
easy to eliminate. 
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blast energy 
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L 


blast radius 
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time since blast 


P air 
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air density 



► 



Do any dimensions occur in only one or two quantities? 



Yes: Mass occurs in E as M 1 and in p air also as M 1 ; time appears in E as T _ ~ 
and in t as T 1 . Therefore, to eliminate mass, the dimensionless group must 
contain E/p ail . To eliminate time, the dimensionless group must contain Et 2 . 
Thus, Et 2 /p aiv eliminates mass and time. Because it has dimensions of L 5 , 
the only way to remove these dimensions without introducing any powers 
of time or mass is to divide by R 5 . The result, Et 2 /p air R 5 , is an independent 
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dimensionless group. It is also the only dimensionless group proportional 
to our goal, the blast energy E. 

With only one dimensionless group, the most general dimensionless state- 
ment about the blast energy is 




For a particular explosion, E is fixed (although unknown), as is p ail . There- 
fore, these quantities drop out of the corresponding scaling relation, which 
becomes t 2 oc R 5 or R oc f 2/5 . On log— log axes, the data on the blast radius 
should fall along a line of slope 2/5 — as they almost exactly do: 



185 | 
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67.3 h 
59 L 
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3.26 4.61 



0.4 slope 



15 
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62 



With the dimensional analysis result Et 2 /p ail R 5 ~ 1, each point predicts a 
blast energy according to E ~ p ail R 5 /t 2 . Using 1 kilogram per cubic meter 
for p air , the E estimates lie between 5.6 and 6.7 x 1 0 1 ’ joules. Unfortunately 
for judging the accuracy of the estimate, joules are an unfamiliar unit for 
the energy of a bomb blast. The familiar unit is tons of TNT. 



What is the predicted blast energy as a mass of TNT? 



One gram of TNT releases 1 kilocalorie or roughly 4 kilojoules. It con- 
tains only one-fourth the energy density of sugar, but the energy is released 
much more rapidly! One kiloton is 10 9 grams, which release 4 x 10 12 joules. 
Our predicted yield of 6 x 10 13 joules is thus roughly 15 kilotons of TNT. 

Just for fun, if we reestimate the yield using a more accurate air density of 
1.2 kilograms per cubic meter, the blast energy would be 18 kilotons. This 
result is shockingly accurate considering the number of approximations it 
contains: The classified yield was 20 kilotons. (The missing universal con- 
stant for shock- wave explosions must be very close to 1.) 

Dimensional analysis is powerful! 
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5.3 More dimensionless groups 

The biggest change in dimensional analysis comes with a second indepen- 
dent dimensionless group. With only one dimensionless group, the most 
general statement that the universe could make was 

dimensionless group ~ 1. (5.31) 

With a second group, the universe gains freedom on the right side: 

group 1 = /(group 2), (5.32) 

where / is a dimensionless function: It takes a dimensionless number as its 
input, and it produces a dimensionless number as its output. 

As an example of two independent dimensionless groups, let's extend the 
analysis of Problem 5.10, where you predicted the impact speed of a rock 
dropped from a height h. The impact speed depends on g and h, so a reason- 
able choice for the independent dimensionless group is v/Jgh . Therefore, 
its value is a universal, dimensionless constant (which turns out to be -J2). 
To make a second independent dimensionless group, we add a degree of 
freedom to the problem: that the rock is thrown downward with speed v {) 
(the previous problem is the case — 0). 

With this complication, what are independent dimensionless groups? 



Adding a quantity but no new independent dimension creates one more 
independent dimensionless group. Therefore, there are two independent 
groups — for example, v/Jgh and v 0 /Jgh. The most general dimensionless 
statement, which has the form group 1 = /(group 2), is 



v 

W l 



= f 




(5.33) 



This point is as far as dimensional analysis can take us. To go further re- 
quires adding physics knowledge (Problem 5.16). But dimensional analysis 
already tells us that this function / is a universal function: It describes the 
impact speed of every thrown object in the universe, no matter the launch 
velocity, the drop height, or the gravitational field strength. 

Problem 5.16 Impact speed of a thrown rock 

For a rock thrown downward with speed v 0 , use conservation of energy to find the 
form of the dimensionless function / in v/Jgh = f(v 0 /^jgh). 
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Problem 5.17 Nonideal spring 

Imagine a mass connected to a spring with force law 
F cc i 3 (instead of the ideal-spring force F oc x) and 
therefore with potential energy V ~ Cx 4 (where C is a 
constant). Which curve shows how the system's oscil- 
lation period T depends on the amplitude x 0 ? 




Problem 5.18 Rolling down the plane 

In this problem, you use dimensional analysis to simplify 
finding the acceleration of a ring rolling (without slipping) 
down an inclined plane. gT"--. 

a. List the quantities on which the ring's acceleration de- 
pends. Hint: Include the ring's moment of inertia and its radius. Do you also 
need to include its mass? 



b. Form independent dimensionless groups to write the dimensionless statement 

group proportional to a = /(groups not containing a). (5.34) 

c. Does a bigger ring roll faster than a smaller ring? 

d. Does a denser ring roll faster than a less dense ring (of the same radius)? 



5.3.1 Bending of starlight by the Sun 

Our next example of two independent dimensionless groups — the deflec- 
tion of starlight by the Sun — will illustrate how to incorporate physical 
knowledge into the mathematical results from dimensional analysis. 

Rocks, birds, and people feel the effect of gravity. So why not light? The 
analysis of its deflection is a triumph of Einstein's theory of general relativ- 
ity. However, the theory is based on ten coupled, nonlinear partial-differ- 
ential equations. Rather than solving these difficult equations, let's use di- 
mensional analysis. 

Because this problem is more complicated than the previous examples, let's 
organize the complexity by making the steps explicit, including the step of 
incorporating physical knowledge. Divide and conquer! 

1. List the relevant quantities. 

2. Form independent dimensionless groups. 

3. Use the groups to make the most general statement about deflection. 

4. Narrow the possibilities by incorporating physical knowledge. 
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Step 1: Listing relevant quantities 

In the first step, we think of and list the quantities that determine the bend- 
ing. To find them, I often draw a diagram with verbal but without quan- 
titative labels. As the diagram cries out for quantitative labels, it suggests 
quantities for the list. 



apparent 




The bent path means that the star shifts its apparent position. The shift 
is most naturally measured not as an absolute distance but rather as an 
angle 6. For example, if 8 = 180° (n radians), much larger than the likely 
deflection, the star would be shifted halfway around the sky. 



l 3 t -2 



The labels and the list must include our goal, which a 
is the deflection angle 8. Because deflection is pro- Gm 
duced by gravity, we should also include Newton's r 

gravitational constant G and the Sun's mass m (we'll 

use the more general symbol m rather than M Sun , because we may apply the 
formula to light paths around other stellar objects). These quantities could 
join the list as two separate quantities. However, the physical consequences 
of gravity — for example, the gravitational force — depend on G and m only 
through the product Gm. Therefore, let's include just the Gm abstraction 
on the list. (Problem 5.19 shows you how to find its dimensions.) 



angle 

Sun's gravity 
closest approach 



The final quantity on the list is based on our knowledge that gravity gets 
weaker with distance. Therefore, we include the distance from the Sun to 
the light beam. The phrase "distance from the Sun" is ambiguous, because 
the beam is at various distances. Our quantity r will be the shortest distance 
from the center of the Sun to the beam (the distance of closest approach). 



apparent 




mass m 
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Problem 5.19 Dimensions of Gm and G 

Use Newton's law of universal gravitation, F = Gm 1 m 2 /r 2 , to find the dimensions 
of Gm and G. 



Step 2: Form independent dimensionless groups 

The second step is to form independent dimensionless groups. One group 
is easy: The angle 6 is already dimensionless. Unfortunately, three quanti- 
ties and two independent dimensions (L and T) produce only one indepen- 
dent dimensionless group. With only one dimensionless group, the most 
general statement is merely 6 = constant. 

This prediction is absurd. The bending, if it is produced by gravity, has to 
depend on Gm and r. These quantities have to appear in a second dimen- 
sionless group. Creating a second group, we know from the Buckingham 
Pi theorem, requires at least one more quantity. Its absence indicates that 
our analysis is missing essential physics. 

What physics is missing? 



No quantity so far distinguishes between the path 
of light and of, say, a planet. A crucial difference 
is that light travels far more rapidly than a planet. 
Let's represent this important characteristic of light 
by including the c. 
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This quantity, a speed, does not introduce a new independent dimension (it 
uses length and time). Therefore, it increases the number of independent 
dimensionless groups by one, from one to two. To find the new group, first 
check whether any dimension appears in only two quantities. If it does, the 
search simplifies. Time appears in only two quantities: in Gm as T -2 and 
in c as T -1 . To cancel out time, the new dimensionless group must contain 
Gm/c 2 . This quotient contains one power of length, so our dimensionless 
group is Gm/rc 2 . 



As we hoped, the new group contains Gm and r. Its form illustrates again 
that quantities with dimensions are not meaningful alone. For knowing just 
Gm is not enough to decide whether or not gravity is strong. As a quantity 
with dimensions, Gm must be compared to another relevant quantity with 
the same dimensions. Here, that quantity is rc 2 , and the comparison leads 
to the dimensionless ratio Gm/rc 2 . 
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Can we choose other pairs of independent dimensionless groups? 

Yes. For example, 6 and Gmd/rc 2 also make a set of independent dimen- 
sionless groups. Mathematically, all pairs of independent dimensionless 
groups are equivalent, in that any pair can represent any quantitative state- 
ment about light bending. 

However, look ahead to the goal: We hope to solve for 6 . If 6 appears in both 
groups, we will end up with an implicit equation for 6, where 8 appears on 
both sides of the equals sign. Although such an equation is mathematically 
legitimate, it is much harder to think about and to solve than is an explicit 
equation, where 8 is on the left side only. 

Therefore, when choosing independent dimensionless groups, place the 
goal quantity in only one group. This rule of thumb does not remove all 
our freedom in choosing the groups, but it greatly limits the choices. 

Problem 5.20 Physical interpretation of the new group 

Interpret the dimensionless group Gm/rc 2 by multiplying by »hight/ m iight snd re- 
grouping the quantities until you find physical interpretations for the numerator 
and denominator. 



Step 3: Make the most general dimensionless statement 

The third step is to use the independent dimensionless groups to write the 
most general statement about the bending angle. It has the form group 1 = 
/(group 2). Here, 




where / is a universal, dimensionless function. Dimensional analysis can- 
not determine /. However, it has told us that / is a function only of Gm/rc 2 
and not of the four quantities G, m, r, and c separately. That information is 
the great simplification. 



Step 4: Use physical knowledge to narrow the possibilities 

The space of possible functions — here, all nonpathological functions of one 
variable — is vast. Therefore, the fourth and final step is to narrow the pos- 
sibilities for / by incorporating physical knowledge. First, imagine increas- 
ing the effect of gravity by increasing the mass m — which increases Gm/rc 2 . 
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This change should, based on our physical intuition about gravity, also in- 
crease the bending angle. Therefore, f should be a monotonically increas- 
ing function of Gm/rc 2 . Second, imagine an antigravity world, where the 
gravitational constant G is negative. Then the Sun would deflect light away 
from it, making the bending angle negative. In terms of x = Gm/rc 2 , this 
constraint eliminates even functions of x, such as fix') ~ x 2 , which produce 
the same sign for the bending angle independent of the sign of x. 



The simplest function meeting both the monotonicity and sign constraints 
is fix') ~ x. In terms of the second group GM/rc 2 , the form fix) ~ x is 
the dimensionless statement about bending 



6 



Gm 
rc 2 



(5.36) 



All reasonable theories of gravity will predict this relation, because it is 
almost entirely a mathematical requirement. The theories differ only in the 
dimensionless factor hidden in the single approximation sign 

( 1 (simplest guess); 

2 (Newtonian gravity); (5.37) 

4 (general relativity). 



The factor of 2 for Newtonian gravity results from solving for the trajectory 
of a rock roving past the Sun with speed c. The factor of 4 for general relativ- 
ity, double the Newtonian value, results from solving the ten partial-differ- 
ential equations in the limit that the gravitational field is weak. 



► 



How large are these angles? 



Let's first estimate the bending angles closer to home — produced by the 
Earth's gravity. For a light ray just grazing the surface of the Earth, the 
bending angle (in radians!) is roughly 10 . 






6.7 x 10 11 kg 1 m 3 s 2 x 6 x 10 24 kg 



Earth 



6.4 x 10 6 m 



x 10 17 m 2 s 2 



0.7x10" 



(5.38) 



Can we observe this angle? 



The bending angle is the angular shift in the position of the star making 
the starlight. To observe these shifts, astronomers compare a telescope pic- 
ture of the star and surrounding sky, with and without the deflection. A 




158 



5 Dimensions 



telescope with lens of diameter D can resolve angles roughly as small as 
A/D, where A is the wavelength of light. As a result, a lens that can resolve 
0.7 x 10 -9 radians has a diameter of at least 700 meters: 



D 



A 

^Earth 



0.5x10 6 m 
0.7 Xl0“ 9 



700 m. 



(5.39) 



On its own, this length does not mean much. However, the largest telescope 
lens has a diameter of roughly 1 meter; the largest telescope mirror, in a 
different telescope design, is still only 10 meters. No practical lens or mirror 
can be 700 meters in diameter. Thus, there is no way to see the deflection 
produced by the Earth's gravitational field. 

Physicists therefore searched for a stronger source of light bending. The 
bending angle is proportional to m/r. The largest mass in the solar system 
is the Sun. For a light ray grazing the surface of the Sun, r = R Sun and 
m = M Sun . So the ratio of Sun-to-Earth-produced bending angles is 



^Sun _ M Sun ^ / R Sun \ 

^Earth ^Earth V ^ Earth / 



(5.40) 



The mass ratio is roughly 3 x 10 5 ; the radius ratio is roughly 100. The ratio 
of deflection angles is therefore roughly 3000. The required lens diame- 
ter, which is inversely proportional to 9, is correspondingly smaller than 
700 meters by a factor of 3000: roughly 25 centimeters or 10 inches. That 
lens size is plausible, and the deflection might just be measurable. 

Between 1909 and 1916, Einstein believed that a correct theory of gravity 
would predict the Newtonian value of 4.2 x 10 -6 radians or 0.87 arcseconds: 

0.7 x 10~ q rad x 2 x 3000 ~ 4.2 x 10 -6 rad. (5.41) 

~<?Earth Newtonian gravity <9 Stm /<9 Earth 



The German astronomer Soldner had derived the same result in 1803. The 
eclipse expeditions to test his (and Soldner's) prediction got rained out or 
clouded out. By the time an expedition got lucky with the weather, in 1919, 
Einstein had invented a new theory of gravity — general relativity — and it 
predicted a deflection twice as large, or 1.75 arcseconds. 

The eclipse expedition of 1919, led by Arthur Eddington of Cambridge Uni- 
versity and using a 13-inch lens, measured the deflection. The measure- 
ments are difficult, and the results were not accurate enough to decide 
clearly which theory was right. But 1919 was the first year after World War 
One, in which Germany and Britain had fought each other almost to obliv- 
ion. A theory invented by a German, confirmed by an Englishman (from 
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Newton's university, no less) — such a picture was welcome after the war. 
The world press and scientific community declared Einstein vindicated. 

A proper confirmation of Einstein's prediction came only with the advent 
of radio astronomy, which allowed small deflections to be measured accu- 
rately (Problem 5.21). The results, described as the dimensionless factor 
multiplying Gm/rc 2 , were around 4 ± 0.2 — definitely different from the 
Newtonian prediction of 2 and consistent with general relativity. 

Problem 5.21 Accuracy of radio-telescope measurements 

The angular resolution of a telescope is A/D, where A is the wavelength and D is the 
telescope's diameter. But radio waves have a much longer wavelength than light. 
How can measurements of the bending angle that are made with radio telescopes 
be so much more accurate than measurements made with optical telescopes? 

Problem 5.22 Another theory of gravity 

An alternative to Newtonian gravity and to general relativity is the Brans-Dicke 
theory of gravitation [3]. Look up what it predicts for the angle that the Sun 
would deflect starlight. Express your answer as the dimensionless prefactor in 
9 ~ Gm/rc 2 . 



5.3.2 Drag 

Even more difficult than the equations of general relativity, which at least 
have been solved for realistic problems, are the Navier-Stokes equations of 
fluid mechanics. We first met them in Section 3.5, where their complexity 
pushed us to find an alternative to solving them directly. We used a conser- 
vation argument, which we tested using a falling cone, and concluded that 
the drag force on an object traveling through a fluid is 

fdrag ~ PV 2 A CS , (5.42) 

where p is the density of the fluid, v is the speed of the object, and A cs is 
its cross-sectional area. This result is the starting point for our dimensional 
analysis. 

With F drag dependent on p, v, and A cs , there are four quantities and three 
independent dimensions. Therefore, by the Buckingham Pi theorem, there 
is one independent dimensionless group. Our expression for the drag force 
already gives one such group: 
r 

1 drag 

pv 2 A cs 



(5.43) 
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Because the pv 2 in the denominator looks like mv 2 in kinetic energy, a factor 
of one-half is traditionally included in the denominator: 



r 

1 drag 

group 1 = T — — ■ 

2 P® ^cs 

The resulting dimensionless group is called the drag coefficient c d . 



c d = 



^"drag 

\pv 2 A c 



(5.44) 



(5.45) 



As the only dimensionless group, it has to be a constant: There is no other 
group on which it could depend. Therefore, 



^pv A cs 



dimensionless constant. 



(5.46) 



This result is equivalent to our prediction based on conservation of energy, 
that F drag ~ pv 2 A cs . However, trouble happens when we wonder about the 
dimensionless constant's value. 



Dimensional analysis, as a mathematical technique, cannot predict the con- 
stant. Doing so requires physical reasoning, which starts with the obser- 
vation that drag consumes energy. Here, no quantity on which F drag de- 
pends — namely, p, v , or A cs — represents a mechanism of energy loss. There- 
fore, the drag coefficient should be zero. Although this value is consistent 
with the prediction that the drag coefficient is constant, it contradicts all 
our experience of fluids! 

Our analysis needs to include a mechanism of energy loss. In fluids, loss 
is due to viscosity (for an object moving at a constant speed and generat- 
ing no waves). Its physics is the topic of Section 7.3.2, as an example of a 
diffusion constant. For our purposes here, we need just its dimensions in 
order to incorporate viscosity into a dimensionless group: The dimensions 
of kinematic viscosity v are L 2 T _ . (Unfortunately, in almost every font, the 
standard symbols for velocity and kinematic viscosity, v and v , look similar; 
however, even more confusion would result from selecting new symbols.) 

Thus, the kinematic viscosity joins our list. Adding one quantity without 
adding a new independent dimension creates a new independent dimen- 
sionless group. Similarly, in our analysis of the deflection of starlight (Sec- 
tion ), adding the speed of light c created a second independent dimension- 
less group. 
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► 



What is this new group containing viscosity? 



Before finding the group, look ahead to how it will be used. With a second 
group, the most general statement has the form 

group 1 = /(group 2). (5.47) 

Cd 



Our goal is F drag , which is already part of group 1. Including F drag also 
in group 2 would be a bad tactic, because it would produce an equation 
with F drag on both sides and, additionally, wrapped on the right side in the 
unknown function /. Keep F drag out of group 2. 



To make this group from the other quantities v, A cs , p, 
and v, look for dimensions that appear in only one or 
two quantities. Mass appears only in the density; thus, 
the density cannot appear in group 2: If it were part of 
group 2, there would be no way to cancel the dimen- 
sions of mass, and group 2 could not be dimensionless. 



V 


LT -1 


speed 


dcs 


L 2 


area 


p 


ML -3 


fluid density 


V 


L 2 T _1 


viscosity 



Time appears in two quantities: speed v and viscosity v . Because each quan- 
tity contains the same power of time (T _1 ), group 2 has to contain the quo- 
tient v/v — otherwise the dimensions of time would not cancel. 



Here's what we know so far about this dimensionless group: So that mass 
disappears, it cannot contain the density p ; so that time disappears, it must 
contain v and v as their quotient v/v. The remaining task is to make length 
disappear — for which purpose we use the object's cross-sectional area A cs . 
The quotient v/v has dimensions of L -1 , so ^A cs v/v is a new, independent 
dimensionless group. 



Instead of /A cs , usually one uses the diameter D . With that choice, our 
group 2 is called the Reynolds number Re: 




The conclusion from dimensional analysis is then 

drag coefficient — f { Reynolds number ) . 
c~d Re 



(5.49) 



For each shape, the dimensionless function / is a universal function; it de- 
pends on the shape of the object, but not on its size. For example, a sphere, 
a cylinder, and a cone have different functions / spher e, /cylinder/ an d / cone . Sim- 
ilarly, a narrow-angle cone and a wide-angle cone have different functions. 
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But a small and a large sphere are described by the same function, as are 
a large and a small cone of the same opening angle. Any difference in the 
drag coefficients for different-sized objects of the same shape results only 
from the difference in Reynolds numbers (different because the sizes are 
different). 

Here we see the power of dimensional analysis. Reynolds number vD/v 

It shows us that the universe doesn't care about 
the size, speed, or viscosity individually. The uni- 
verse cares about them only through the abstrac- 
tion known as the Reynolds number. It is the only speed v size D viscosity v 
information needed to determine the drag coeffi- 
cient (for a given shape). 

Now let's use this dimensionless framework to analyze the cone experi- 
ment: The experimental data showed that the small and large cones fell 
at the same speed — roughly 1 meter per second. 

What are the corresponding Reynolds numbers? 




The small cone has a diameter of 0.75 x 7 centimeters (0.75 because one-quar- 
ter of the original circumference was removed), which is roughly 5.3 cen- 
timeters. The viscosity of air is roughly 1.5 x 10 -5 square meters per second 
(an estimate that we will make in Section 7.3.2). The resulting Reynolds 
number is roughly 3500: 



v D 

lms" 1 x 0.053m 
1.5 x 10 -5 m 2 s -1 



V 



3500. 



(5.50) 



► 



What is the Reynolds number for the large cone? 



Never make a calculation from scratch when you can use proportional rea- 
soning. Both cones experience the same viscosity and share the same fall 
speed. Therefore, we can consider v and v to be constants, leaving the 
Reynolds number vD/v proportional just to the diameter D. 

Because the large cone has twice the diameter of the small cone, it has twice 
the Reynolds number: 

Re large = 2 x Re smaU * 7000. (5.51) 

At that Reynolds number, what is the cone's drag coefficient? 
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The drag coefficient is 



c d = 



drag 



(5.52) 



-n 7 j 2 A 
2 r air u -^cs 

The cone falls at its terminal speed, so the drag force is also its weight W : 



= W = A„ 



(5.53) 



drag v v pap er ^p ap er $1 

where (r paper is the areal density (mass per area) of paper and A paper is the 
area of the cone template. The drag coefficient is then 



r drag 



c d 



Rpaper Rpaper & 



— /T A 7)^ 

2 r air^cs ^ 

As we showed in Section 3.5.2, 

A, 



3. 

A cs ^ Rpaper* 



(5.54) 



(5.55) 



This proportionality means that the areas cancel out of the drag coefficient: 

Rpaper £ 



C d 



-O X -V 1 
2 r air x 4 u 



(5.56) 



To compute c d , plug in the areal density n paper ~ 80 grams per square meter 
and the measured speed v « 1 meter per second: 



c d 



8x10 2 kg m - x 10ms - 
\ x 1.2 kg m -3 x | x lm 2 s" 2 



1.8. 



(5.57) 



Because no quantity in this calculation depends on the cone's size, both 
cones have the same drag coefficient. (Our estimated drag coefficient is sig- 
nificantly larger than the canonical drag coefficient for a solid cone, roughly 
0.7, and is approximately the drag coefficient for a wedge.) 

Thus, the drag coefficient is independent of Reynolds number — at least, 
for Reynolds numbers between 3500 and 7000. The giant-cone experiment 
of Problem 4.16 shows that the independence holds even to Re ~ 14000. 
Within this range, the dimensionless function f in 

drag coefficient = / cone ( Reynolds number) (5.58) 



is a constant. What a simple description of the complexity of fluid flow! 
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This conclusion for / cone is valid for most shapes. The most extensive drag 
data is for a sphere — plotted below on log-log axes (data adapted from 
Fluid-Dynamic Drag: Practical Information on Aerodynamic Drag and Hydro- 
dynamic Resistance [23]): 




Like the drag coefficient for the cone, the drag coefficient for a sphere is 
almost constant as the Reynolds number varies from 3500 to 7000. The drag 
coefficient holds constant even throughout the wider range 2000. ..3 x 10 5 . 
Around Re « 3 x 10 5 , the drag coefficient falls by a factor of 5, from 0.5 to 
roughly 0.1. This drop is the reason that golf balls have dimples (you will 
explain the connection in Problem 7.24). 

At low Reynolds numbers, the drag coefficient becomes large. This behav- 
ior, which represents a small object oozing through honey, will be explained 
in Section 8. 3. 1.2 using the tool of easy cases. The main point here is that, 
for most everyday flows, where the Reynolds number is in the "few thou- 
sand to few hundred thousand" range, the drag coefficient is a constant that 
depends only on the shape of the object. 

Problem 5.23 Giant cone 

How large, as measured by the diameter of its cross section, would a paper cone 
need to be in order for its fall to have a Reynolds number of roughly 3 x 10 5 (the 
Reynolds number at which the drag coefficient of a sphere drops significantly)? 

Problem 5.24 Reynolds numbers 

Estimate the Reynolds number for (a) a falling raindrop, (b) a flying mosquito, (c) a 
person walking, and (d) a jumbo jet flying at its cruising speed. 
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Problem 5.25 Compound pendulum 

Use dimensional analysis to deduce as much as you can about the 
period T of a compound pendulum — that is, a pendulum where the 
bob is not a point mass but is an extended object of mass m. The 
light rod (no longer a string) of length 1 is fixed to the center of mass 
of the object, which has a moment of inertia I CM about the point of 
attachment. (Assume that the oscillation amplitude is small and 
therefore doesn't affect the period.) 

Problem 5.26 Terminal velocity of a raindrop 

In Problem 3.37, you estimated the terminal speed of a raindrop using 

^"drag ~ Pair^cs^ 7 * (5.59) 

Redo the calculation using the more precise information that c d « 0.5 and retain- 
ing the factor of 4zr/3 in the volume of a spherical raindrop. 




5.4 Temperature and charge 

The preceding examples of dimensional analysis have been mechanical, us- 
ing the dimensions of length, mass, and time. Temperature and charge are 
also essential to describing the world and are equally amenable to dimen- 
sional analysis. Let's start with temperature. 

5.4.1 Temperature 

To handle temperature, do we need to add another fundamental dimension? 

Representing temperature as a new fundamental dimension is one method, 
and the dimension is symbolized by 0. However, there's a simpler method. 
It uses another fundamental constant of nature: Boltzmann's constant k B . 
It has dimensions of energy per temperature; thus, it connects temperature 
to energy. In SI units, k B is roughly 1.6 x 10 -23 joules per kelvin. When a 
temperature T appears, convert it to the corresponding energy k B T (just as 
we convert G, in gravity problems, to Gm). 

For our first temperature example, let's estimate the speed of air molecules. 
We will then use that knowledge to estimate the speed of sound. Because 
the speed v of air molecules is a result of thermal energy, it depends on 
k B T. But v and k B T — two quantities made from two independent dimen- 
sions — cannot make a dimensionless group. We need one more quantity: 
the mass m of an air molecule. 
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Although these three quantities contain three di- v LT -i 
mensions, only two of the dimensions are indepen- ML 2 T -2 

dent — for example, M and LT _1 . Therefore, the m M 
three quantities produce one independent dimen- 
sionless group. A reasonable choice for it is mv 2 /k B T. Then 




thermal speed 
thermal energy 
molecular mass 



(5.60) 



This result is quite accurate. The missing dimensionless prefactor is 43 for 
the root-mean-square speed and ^8/n for the mean speed. 



The dimensional-analysis result helps predict the speed of sound. In a 
gas, sound travels because of pressure, which is a result of thermal motion. 
Thus, it is plausible that the speed of sound c s should be comparable to the 
thermal speed. Then, 




(5.61) 



To evaluate the speed numerically, multiply by a form of 1 based on Avo- 
gadro's number: 



k B T >< N A ' 
m N a ' 



(5.62) 



The numerator now contains k B N A , which is the universal gas constant R: 

8J 



R 



molK’ 



(5.63) 



The denominator in the square root is mN A : the mass of one molecule mul- 
tiplied by Avogadro's number. It is therefore the mass of one mole of air 
molecules. Air is mostly N 2 , with an atomic mass of 28. Including the oxy- 
gen for us animals, the molar mass of air is roughly 30 grams per mole. 
Plugging in 300 K for room temperature gives a thermal speed of roughly 
300 meters per second: 



' 8 J mol -1 K” 



x 300 K 



3 x 10 -2 kg mol 



-l 



300 ms 



(5.64) 



The actual sound speed is 340 meters per second, not far from our estimate 
based on dimensional analysis and a bit of physical reasoning. (The discrep- 
ancy remaining is due to the difference between isothermal and adiabatic 
compression and expansion. For a related effect, later try Problem 8.27.) 
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With this understanding of how to handle temperature, let's use dimen- 
sional analysis to estimate the height of the atmosphere. This height, called 
the scale height H, is the length over which the atmosphere's properties, 
such as pressure and density, change significantly. At the height H, the at- 
mospheric density and pressure will be significantly lower than at sea level. 



The first step in dimensional analysis is to list 
the quantities that determine the height. That 
list requires a physical model: The atmosphere 
is a competition between gravity and thermal 

motion. Gravity drags molecules toward Earth; 

thermal motion spreads them all over the universe. Our list must include 
quantities representing both sides of that competition: m and g for gravity, 
and k B T for thermal energy. 



H 


L 


atmosphere height 


8 


LT -2 


gravity 


hT 


ML 2 T -2 


thermal energy 


m 


M 


molecular mass 



Four quantities (including the goal H) built from three independent dimen- 
sions produce one independent dimensionless group. A reasonable choice 
for the group — reasonable because it is proportional to the goal H — is the 
ratio mgH /k B T. Therefore, the atmosphere's scale height is given by 



k^T 
mg ' 



( 5 . 65 ) 



To evaluate this height numerically, again convert Boltzmann's constant k B 
to the universal gas constant R by multiplying by N A /N A : 



hN A T 

mN A g 



SJmol^K -1 x 300 K 
— « 8 km. 

3xl0 -2 kgmol 1 x 10 ms -2 



( 5 . 66 ) 



^ molar 



Thus, by 8 kilometers above sea level, atmospheric pressure and density 
should be significantly lower than at sea level. This conclusion is reason- 
able: Mount Everest is 9 kilometers high, where the significantly thinner 
air requires climbers to carry oxygen tanks. 

As a rule of thumb for a decaying function such as atmospheric pressure 
or density, a "significant change" can be estimated as a rise or, here, a fall 
by a factor of e. (For more on this rule of thumb, try Problem 6.36 after you 
have studied lumping; see also Section 3.2.1 of Street-Fighting Mathematics 
[33].) Indeed, at 8 kilometers, the standardized atmospheric parameters 
relative to sea level are p/p 0 ~ 0.43 and p/p 0 ~ 0.35. For comparison, 1/e is 
approximately 0.37. 
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5.4.2 Charge 

Our second new dimension is charge, symbolized by Q. By introducing 
charge, we can apply dimensional analysis to electrical phenomena. Let's 
begin by finding the dimensions of electrical quantities. 

What are the dimensions of current, voltage, and resistance? 



Current is the flow of charge; it has dimensions of charge per time. Using 
I to denote current and [I] to denote its dimensions, 

[I] = QT” 1 . (5.67) 



The dimensions of voltage can be determined from the relation 
voltage x current = power. 



(5.68) 



The corresponding dimensional equation is 
[power] 



[voltage] = 



[current] 



(5.69) 



Because the dimensions of power (energy per time) are ML 2 T , and the 
dimensions of current are QT _1 , the dimensions of voltage are ML 2 T~ Q _ 1 : 

[voltage] = ML T 1 - = ML 2 T“ 2 Q _1 . (5.70) 

QT” 1 



In this expression, the dimensional combination ML 2 T -2 is energy. There- 
fore, voltage is also energy per charge. As a result, an electron volt (1 elec- 
tron charge times 1 volt) is an energy. A few electron volts is the energy 
in chemical bonds. Because chemical bonds reflect a change in one or two 
electrons, atomic and molecular voltages must be a few volts. (An everyday 
consequence is that typical batteries supply a few volts.) 



To find the dimensions of resistance, we'll write Ohm's law as a dimen- 
sional equation: 



[resistance] = 



[voltage] 

[current] 



ML 2 T~ 2 Q~ 1 

QT -1 



= ML 2 T _1 Q“ 2 . 



(5.71) 



To simplify these dimensions, use [ V] — the dimensions of voltage — as an 
abstraction for the dimensional mess ML 2 T _2 Q _1 . Then, 

[resistance] = [V] x TQ -1 . (5.72) 



This alternative is useful when the input to and output of a circuit is volt- 
age. Then treating the dimensions of voltage as a fundamental dimension 
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simplifies the task of finding dimensionless groups. (For practice in using 
the dimensions of voltage, try Problem 5.27.) 



Problem 5.27 Capacitance and inductance 

Express the dimensions of capacitance and inductance using L, M, T, and Q. Then 
write them using [V] (the dimensions of voltage) along with any fundamental 
dimensions that you need. 



Problem 5.28 RC circuit 

In this problem, you apply dimensional analysis 
to the low-pass RC circuit that we introduced in 1 
Section 2.4.4. In particular, make the input volt- 
age V in zero for time t < 0 and a fixed voltage V 0 
for t > 0. The goal is the most general dimension- 
less statement about the output voltage V, which 
depends on V 0 , t, R, and C. 



V 



ground 



a. Using [V] to represent the dimensions of voltage, fill in a dimensional-analysis 
table for the quantities V , V 0 , t, R, and C. 

b. How many independent dimensions are contained in these five quantities? 

c . Form independent dimensionless groups and write the most general statement 
in the form 

group containing V but not t =/(group containing t but not V , ...), (5.73) 

where the ... stands for the third dimensionless group (if it exists). Compare 
your expression to the analysis of the RC circuit in Section 2.4.4. 



Problem 5.29 Dimensional analysis of an LRC circuit 

In the LRC circuit, the input voltage V in is the real L ( 

part of a complex-exponential signal Vge^*, where Pm — TTTT) — | 

V 0 is the input amplitude and cv is the angular os- 
cillation frequency. The output voltage V out is then 
the real part of V 1 e- ,a ’ i , where W is the (possibly 
complex) output amplitude. What quantities deter- 
mine the gain G, defined as the ratio Vi/V 0 7 What is a good set of independent 
dimensionless groups built from G and these quantities? 




Along with the potential (or voltage) V , a leading actor in electromagnetics 
is the electric field. (Once you understand how to handle electric fields, 
you can practice by analyzing magnetic fields — try Problems 5.31, 5.32, and 
5.33.) Electric fields not only transmit force; they also contain energy. This 
energy is important, not least because its transport is how the Sun warms 
the Earth. We will estimate the energy contained in electric fields by using 
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dimensional analysis. This analysis will give us the result that we used in 
Section 2.4.2 to estimate, by analogy, the energy in a gravitational field. Like 
the gravitational field, the electric field extends over space. Therefore, we 
usually want not the energy itself, but rather the energy per volume — the 
energy density. 

What is the energy density in an electric field? 



To answer this question with dimensional analysis, let's follow most of the 
steps that we used in estimating the bending of starlight (Section 5.3.1). 

1. List the relevant quantities. 

2. Form independent dimensionless groups. 

3. Use the groups to make the most general statement about the energy 
density. 

4. Narrow the possibilities by incorporating physical knowledge. For this 
problem, we will be able to skip this step. 



Thus, the first step is to tabulate the quantities on which our goal, the energy 
density <5, depends. It definitely depends on the electric field E. It also prob- 
ably depends on e Q , the permittivity of free space appearing in Coulomb's 
law, because most results in electrostatics contain e 0 . But <5 shouldn't de- 
pend on the speed of light: The speed of light suggests radiation, which 
requires a changing electric field; however, even a constant electric field 
contains energy. Thus, our list might be complete. Let's see, by trying to 
make a dimensionless group. 



e 

E 

£ o 



ML' 



-lrjH— 2 



MLT“ 2 Q“ 



So we list the quantities with their dimensions, 
with the goal at the head of the table. Energy 
density is energy per volume, so its dimensions 
are ML -1 T -2 . Electric field is force per charge 
(just as gravitational field is force per mass — analogy!), so its dimensions 
are MLT“ 2 Q _1 . 



M -1 L“ 



3 t 2q2 



energy density 
electric field 
SI constant 



What are the dimensions ofe 0 ? 



This quantity is the trickiest. It shows up in Coulomb's law: 



electrostatic force = 



\ne> 



o r 2 



( 5 . 74 ) 



As a dimensional equation. 
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[e 0 ] 



Q 2 

[F] L 2 ' 



(5.75) 



where [F] = MLT 2 represents the dimensions of force. Therefore, e (J has 
dimensions of M _1 L _, T 2 Q 2 . 



The second step is to find independent dimensionless groups. Any search 
is aided by knowing how many items to find. This count is provided by the 
Buckingham Pi theorem. To apply the theorem, we first need to count the 
independent dimensions. 



► 



How ninny independent dimensions do the three quantities contain? 



At first glance, the quantities contain four independent dimensions: M, L, 
T, and Q. However, the Buckingham Pi theorem would then predict —1 
dimensionless groups, which is nonsense. Indeed, the number of indepen- 
dent dimensions cannot exceed the number of quantities (a restriction that 
you explained in Problem 5.7). Here, as you verify in Problem 5.30, there are 
only two independent dimensions — for example, MLT Q (the dimen- 
sions of electric field) and L 2 Q -1 . 

Three quantities constructed from two independent dimensions produce 
one independent dimensionless group. A useful choice for this group, be- 
cause it is proportional to the goal S, is G/e 0 E 2 . 

The third step is to use the independent dimensionless group to make the 
most general statement about the energy density. With only one group, the 
most general statement is 

G ~ e 0 E 2 . (5.76) 

The fourth step is to narrow the possibilities by using physical knowledge. 
With only one independent dimensionless group, the space of possibilities 
is already narrow. The only freedom is the dimensionless prefactor hidden 
in the single approximation sign it turns out to be 1/2. In Section 5.4.3, 
the scaling G oc E 2 will help us explain the surprising behavior of the elec- 
tric field produced by an accelerating charge — and thereby explain why 
stars are visible and radios work. 

Problem 5.30 Rewriting the dimensions 

Express the dimensions of 8, E, and e 0 in terms of [E] (the dimensions of electric 
field) and L 2 Q -1 . Thus, show that the three quantities contain only two indepen- 
dent dimensions. 
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Problem 5.31 Dimensions of magnetic field 

A magnetic field B produces a force on a moving charge q given by 

F = q (v x B), (5.77) 

where v is the charge's velocity. Use this relation to find the dimensions of mag- 
netic field in terms of M, L, T, and Q. Therefore, give the definition of a tesla, the 
SI unit of magnetic field. 

Problem 5.32 Magnetic energy density 

Just as electric fields depend on e 0 , the permittivity of free space, magnetic fields 
depend on the constant }i 0 called the permeability of free space. It is defined as 

fi 0 = 4 77 x 10 -7 N A -2 , (5.78) 

where N is a newton and A is an ampere (a coulomb per second). Express the 
dimensions of ;; 0 in terms of M, L, T, and Q. 

Then use the dimensions of magnetic field (Problem 5.31) to find the energy den- 
sity in a magnetic field B. (As with the electric field, the dimensionless prefactor 
will be 1/2, but dimensional analysis does not give us that information.) What 
analogies can you make between electrostatics and magnetism? 

Problem 5.33 Magnetic field due to a wire 

Use the dimensions of B (Problem 5.31) and of the permeability of free space 
(Problem 5.32) to find the magnetic field B at a distance r from an infinitely long 
wire carrying a current I . The missing dimensionless prefactor, which dimensional 
analysis cannot tell us, turns out to be 1/27T. 

Magnetic-resonance imaging (MRI) machines for medical diagnosis use fields of 
the order of 1 tesla. If this field were produced by a current-carrying wire 0.5 me- 
ters away, what current would be required? Therefore, explain why these magnetic 
fields are produced by superconducting magnets. 

Problem 5.34 Fields due to a uniform sheet of charge 

Imagine an infinite, uniform sheet of charge containing a charge per area cr. Use 
dimensional analysis to find the electric field E at a distance z from the sheet. (The 
missing dimensionless prefactor turns out to be 1!) Therefore, give the scaling 
exponent n in £ oc z". (In Problem 6.21, you'll investigate a physical explanation 
for this surprising result.) 

Use the analogy between electric and gravitational fields (Section 2.4.2) to find the 
gravitational field above a uniform sheet with mass per area cr. 



5.4.3 Power radiated by a moving charge 

In our next example, we'II estimate the power radiated by a moving charge, 
which is how a radio broadcasting antenna works. This power, when com- 
bined with our long-standing understanding of flux (Section 3.4.2) and our 
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new understanding of the energy density in an electric field (Section 5.4.2), 
will predict a surprising scaling relation for the strength of the radiation 
field, one responsible for our ability to see the world. The example also 
illustrates a simple way to manage the dimensions of charge. 

How does the power radiated by a moving charge depend on the acceleration of the 
charge? 

For the moment, let's ignore the charge's velocity. Most likely, the depen- 
dence on acceleration is a scaling relation 

power oc (acceleration)". (5.79) 

We will find the scaling exponent n using dimensional analysis. 

First, we list the quantities on which the goal, the radiated power P, de- 
pends. It certainly depends on the charge's motion, which is represented 
by its acceleration a. It also depends on the amount of charge q, because 
more charge probably means more radiation. The list should also include 
the speed of light c, because the radiation travels at the speed of light. 

It probably also needs the permittivity of free space e 0 - However, rather 
than including c () directly, let's reuse the shortcut for gravitation, where we 
combined Newton's constant G with a mass (in Section 5.3.1). Similarly, e 0 
always appears as c/ 2 /47tc () , whether in electrostatic energy or force. There- 
fore, rather than including q and e 0 separately, we can include only q 2 /4ne 0 . 

What happened to the charge's velocity? 

If the radiated power depended on the velocity, then we could use the prin- 
ciple of relativity to make a perpetual- motion machine: We generate en- 
ergy simply by using a different (inertial) reference frame, one in which the 
charge moves faster. Rather than believing in perpetual motion, we should 
conclude that the velocity does not affect the power. Equivalently, we can 
eliminate the velocity by switching to a reference frame where the charge 
has zero velocity. 

Why doesn't the reference-frame argument allow us to eliminate the acceleration? 

It depends on changing to another inertial reference frame — that is, a frame 
that moves at a constant velocity relative to the original frame. This relative 
motion does not affect the charge's acceleration, only its velocity. 
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However, if we change to a noninertial — an accelerating — reference frame, 
we must modify the equations of motion, adding terms for the Coriolis, cen- 
trifugal, and Euler forces. (For more on reference frames, a wonderful expo- 
sition is John Taylor's Classical Mechanics [45].) If we switch to a noninertial 
frame, all bets are off about the radiated power. In summary, acceleration 
is different from velocity. 



q 2 /4ne 0 



ML 2 T “ 3 

ML 3 T “ 2 



LT 

LT 



.-l 

-2 



These four quantities, built from three inde- 
pendent dimensions, produce just one inde- 
pendent dimensionless group. Because mass 
appears in only P and c] 2 /4?T£ 0 (as M 1 ), the 
group must contain P/(c] 2 /4ne 0 ). This quo- 
tient has dimensions of L _1 T _ . Multiplying it by c 3 /a 2 , which has dimen- 
sions of LT, makes the result dimensionless. Therefore, the independent 
dimensionless group proportional to P is 



radiated power 
electrostatic mess 
speed of light 
acceleration 



q 2 /4ne 0 a 2 



(5.80) 



As the only group, it must be a dimensionless constant, so 



4ne, 



o c° 



(5.81) 



The exact result is almost identical: 



P = 



67re 0 c 3 ’ 



(5.82) 



This result is even correct at relativistic speeds, as long as we use the rela- 
tivistic acceleration (the four-acceleration) as the generalization of a. 

As a scaling relation between P and a, it is simple: P oc a 2 . Doubling the 
acceleration quadruples the radiated power. The steep dependence on the 
acceleration is an important part of the reason that the sky is blue (we'll do 
the analysis in Section 9.4.1). 

This power is carried by a changing electric field. By using the energy den- 
sity in the electric field, we can estimate and explain the surprising strength 
of this field. We start by estimating the energy flux (the power per area) at 
a distance r, by spreading the radiated power over a sphere of radius r . The 
sphere's surface area is comparable to r 2 , so 

P_ ~ q 2 a 2 l_ 
r 2 47 rc 0 c 3 r 2 ’ 



energy flux 



(5.83) 
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Energy flux, based on what we learned in Section 3.4.2, is connected to en- 
ergy density by 

energy flux = energy density x transport speed. (5.84) 



The transport speed is the speed of light c, so 



a 2 1 



47re 0 c 3 r 2 



e 0 E 2 



c. 



energy flux energy density speed 

Now we can solve for the electric field: 
e 0 c 2 r 



(5.85) 



(5.86) 



As a scaling relation, E ex r . Compare it to the —2 scaling exponent for 
an electrostatic field, where E oc r~ 2 (Coulomb's law). The radiation field 
therefore falls off more slowly with distance than the electrostatic field. This 
important difference explains why we can receive radio signals and see 
stars. If radiation fields were proportional to 1/r 2 , stars, and most of the 
world, would be invisible. What difference a scaling exponent can make! 



5.5 Atoms, molecules, and materials 

With our growing repertoire of dimensional analyses, we can explore ever 
more of the world. Perhaps the most fundamental property of the world 
is that it is composed of atoms. Feynman argued for the importance of the 
atomic theory in his famous lectures on physics [14, Vol. 1, p. 1-2]: 

If, in some cataclysm, all of scientific knowledge were to be destroyed, and 
only one sentence passed on to the next generation of creatures, what statement 
would contain the most information in the fewest words? I believe it is the atomic 
hypothesis (or the atomic fact, or whatever you wish to call it) that all things are 
made of atoms — little particles that move around in perpetual motion, attracting each 
other when they are a little distance apart, but repelling upon being squeezed into one 
another. In that one sentence, you will see, there is an enormous amount of infor- 
mation about the world.... [emphasis in original] 

The atomic theory was first stated over 2000 years ago by the ancient Greek 
philosopher Democritus. Using quantum mechanics, we can predict the 
properties of atoms in great detail — but the analysis involves complicated 
mathematics that buries the core ideas. By using dimensional analysis, we 
can keep the core ideas in sight. 
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5.5.1 Dimensional analysis of hydrogen 



We'll study the simplest atom: hydrogen. Dimensional analysis will ex- 
plain its size, and its size will in turn explain the size of more complex 
atoms and molecules. Dimensional analysis will also help us estimate the 
energy needed to disassemble a hydrogen atom. This energy will in turn 
explain the bond energies in molecules. These energies will explain the 
stiffness of materials, the speed of sound, and the energy content of fat and 
sugar — all starting from hydrogen! 



In the dimensional analysis of the size of hydrogen, the zeroth step is to give 
the size a name. The usual name and symbol for the radius of hydrogen is 
the Bohr radius a 0 . 



The first step is to find the quantities that determine the size. That deter- 
mination requires a physical model of hydrogen. A simple model is an 
electron orbiting a proton at a distance a 0 . Their electrostatic attraction 
provides the force F holding the electron in orbit: 



F = 



e 2 1 
^ ne 0 



( 5 . 87 ) 




where e is the electron charge. Our list of quantities should include the 
quantities in this equation; in that way, we teach dimensional analysis about 
what kind of force holds the atom together. Thus, we should somehow in- 
clude e and e 0 . As when we estimated the power radiated by an accelerating 
charge (Section 5.4.3), we'll include e and e 0 as one quantity, e 2 /47TC 0 . 

The force on the electron does not by itself deter- L size 

mine the electron's motion. Rather, the motion is £ i /\ neQ ML 3 T -2 electrostatics 

determined by its acceleration. Computing the m M electron mass 

acceleration from the force requires the electron 

mass m e , so our list should include m e . 

These three quantities made from three independent dimensions produce 
zero dimensionless groups (another application of the Buckingham Pi the- 
orem introduced in Section 5.1.2). Without any dimensionless groups, we 
cannot say anything about the size of hydrogen. The lack of a dimension- 
less group tells us that our simple model of hydrogen is too simple. 

There are two possible resolutions, each involving new physics that adds 
one new quantity to the list. The first possibility is to add special relativ- 
ity by adding the speed of light c. That choice produces a dimensionless 
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group and therefore a size. However, this size has nothing to do with hydro- 
gen. Rather, it is the size of an electron, considering it as a cloud of charge 
(Problem 5.37). 



The other problem with this approach is that electromagnetic radiation trav- 
els at the speed of light, so once the list includes the speed of light, the elec- 
tron might radiate. As we found in Section 5.4.3, the power radiated by an 
accelerating charge is 

P ~ - - — mr- ( 5 . 88 ) 



4710 , 



0 



An orbiting electron is an accelerating electron (accelerating inward with 
the circular acceleration a = v 2 /a 0 ), so the electron would radiate. Radia- 
tion would carry away energy from the electron, the electron would spiral 
into the proton, and hydrogen would not exist — nor would any other atom. 
So adding the speed of light only compounds our problem. 

The second resolution is instead to add quantum mechanics. Its fundamen- 
tal equation is the Schrodinger equation: 

(“£ V2 + y ) !/ ’ = E!/ ’- (5 ' 89) 



Most of the symbols in this partial-differential equation are not important 
for dimensional analysis — this disregard is how dimensional analysis sim- 
plifies problems. For dimensional analysis, the crucial point is that Schro- 
dinger's equation contains a new constant of nature: h, which is Planck's 
constant h divided by 2tt . We can include quantum mechanics in our model 
of hydrogen simply by including h on our list of relevant quantities. In that 
way, we teach dimensional analysis about quantum mechanics. 

The new quantity h is an angular momentum, which is a length (the lever 
arm) times linear momentum ( mv ). Therefore, its dimensions are 

x x LT" 1 = ML 2 T _1 . ( 5 . 90 ) 

M [m] [i>] 



The h might save hydrogen. It contributes a 


a o 


L 


size 


fourth quantity without a fourth independent 


e 2 /4ne 0 


ML 3 T “ 2 


electrostatic mess 


dimension. Therefore, it creates an indepen- 


m e 


M 


electron mass 


dent dimensionless group. 


h 


ml 2 t _1 


quantum 



To find it, look for a dimension appearing in 

only two quantities. Time appears in e 2 /47re 0 as T~ 2 and in h as T . There- 
fore, the group contains the quotient h 2 /(e 2 /Ane 0 ). Its dimensions are ML, 
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which can be canceled by dividing by m e a 0 . The resulting dimensionless 
group is 



m e fl 0 (e 2 /47re 0 ) ' 



As the only independent dimensionless group, it must be a dimensionless 
constant. Therefore, the size (as the radius) of hydrogen is given by 



«o 



h 2 

m e (e 2 /47re 0 ) 



(5.92) 



Now let's plug in the constants. We could just look up each constant, but 
that approach gives exponent whiplash; the powers of ten swing up and 
down and end on a seemingly random value. To gain insight, we'll instead 
use the numbers for the backs of envelopes (p. xvii). That table deliberately 
has no entry for h by itself, nor for the electron mass m e . Both omissions 
can be resolved with a symmetry operation (Problem 5.35). 



Problem 5.35 Shortcuts for atomic calculations 

Many atomic problems, such as the size or binding energy of hydrogen, end up in 
expressions with h, the electron mass m e , and r 2 /47re 0 . You can avoid remember- 
ing those constants by instead remembering the following values: 



he « 200 eV nm = 2000 eV A 
e c 2 ~ 0.5 x 10 6 eV (the electron rest energy) 



e 2 /4 ne 0 
he 



(5.93) 



= a 



1/137 (the fine-structure constant). 



Use these values and dimensional analysis to find the energy of a photon of green 
light (which has a wavelength of approximately 0.5 micrometers), expressing your 
answer in electron volts. 



In our expression for the Bohr radius, h 2 /m e (e 2 /4TTe 0 ), we can replace h 
with he and m e with m e c 2 simultaneously: Multiply by 1 in the form c 2 /c 2 ; 
it is a convenient symmetry operation. 



a o 



h 2 



X — = 



(i he) 7 



m e (e 2 /Ane 0 ) c 2 m e c 2 (e 2 /47re 0 ) 



(5.94) 



Now the table provides the needed values: for he, m e c 2 , and e 2 /47rc () . How- 
ever, we can make one more simplification because the electrostatic mess 
e 2 /47re 0 is related to a dimensionless constant. To see the relation, multiply 
and divide by he: 
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47re n 



e 2 /47re 0 A 

he J 



he. 



(5.95) 



The factor in parentheses is known as the fine-structure constant a; it is a 
dimensionless measure of the strength of electrostatics, and its numerical 
value is close to 1/137 or roughly 0.7 x 10 -2 . Then 

e 2 

= tthc. (5.96) 

47TC 0 

This substitution further simplifies the size of hydrogen: 

(he) 2 (he) 2 he 



«o 



m e c 2 x e 2 /47T6 0 m e c 2 x ochc a x m e° 2 



(5.97) 



Only now, having simplified the calculation down to abstractions worth 
remembering (a, m e c 2 , and he), do we plug in the entries from the table. 



he 



a 0 



200 eV nm 



0.7 x 10 -2 x 5 x 10 5 eV 



(5.98) 



This calculation we can do mentally. The units of electron volts cancel, leav- 
ing only nanometers (nm). The two powers of ten upstairs (in the numer- 
ator) and the three powers of ten downstairs (in the denominator) result 
in 10 _1 nanometers or 1 angstrom (10 -10 meters). The remaining factors 
result in a factor of 1 /2: 2/(0 .7 x 5) « 1/2. 

Therefore, the size of hydrogen — the Bohr radius — is about 0.5 angstroms: 



a 0 ~ 0.5 x 10 -10 m = 0.5 A. 



(5.99) 



Amazingly, the missing dimensionless prefactor is 1. Thus, atoms are ang- 
strom-sized. Indeed, hydrogen is 1 angstrom in diameter. All other atoms, 
which have more electrons and therefore more electron shells, are larger. A 
useful rule of thumb is that a typical atomic diameter is 3 angstroms. 

What binding energy does this size produce? 



The binding energy is the energy required to disassemble the atom by re- 
moving the electron and dragging it to infinity. This energy, denoted E 0 , 
should be roughly the electrostatic energy of a proton and electron sepa- 
rated by the Bohr radius a 0 : 
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p 

£f) ~ . 

Ane 0 n 0 

With our result for a 0 , the binding energy becomes 
The missing dimensionless prefactor is just 1/2: 

Problem 5.36 Shortcut to calculate the binding energy of hydrogen 

Use the shortcuts in Problem 5.35 to show that the binding energy of hydrogen is 
roughly 14 electron volts. From your symbolic expression for the energy, which is 
also the kinetic energy of the electron, estimate its speed as a fraction of c. 

In Problem 5.36, you showed, using the values of he, m e c 2 , and a., that this 
energy is roughly 14 electron volts. For the sake of a round number, let's 
call the binding energy roughly 10 electron volts. 

This energy sets the scale for chemical bonds. In Section 3.2.1, we calcu- 
lated, by unit conversion, that 1 electron volt per molecule corresponded 
to roughly 100 kilojoules per mole. Therefore, breaking a chemical bond 
requires roughly 1 megajoule per mole (of bonds). As a rough estimate, it 
is not far off. For example, if the molecule consists of a few life-related 
atoms (carbon, oxygen, and hydrogen), then the molar mass is roughly 
50 grams — a small jelly donut. Therefore, burning a jelly donut, as our 
body does slowly when we eat the donut, should produce roughly 1 mega- 
joule — a useful rule of thumb and way of imagining a megajoule. 

Problem 5.37 Adding the speed of light 

If we assume that a 0 depends on e 2 /4 ne 0 , m e , and c, what size would dimensional 
analysis predict for hydrogen? This size is called the classical electron radius and 
denoted r 0 . How does it compare to the actual Bohr radius? 

Problem 5.38 Thermal expansion 

Estimate a typical thermal-expansion coefficient, also denoted a. It is defined as 

fractional change in a substance's length 

a = 5 ( 5 . 103 ) 

change m temperature 

The thermal-expansion coefficient depends on the binding energy E 0 . Assuming 
Eg ~ 10 eV, compare your estimate for a with its value for everyday substances. 



(5.100) 



(5.101) 



(5.102) 
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Problem 5.39 Diffraction in the eye 

Light passing through an opening, such as a telescope 
aperture or the pupil in the eye, diffracts (spreads out). 

By estimating the diffraction angle 9, we will be able 
to understand aspects of the design of the eye. 

a. What are the valid dimensionless relations connect- 
ing the diffraction angle 9, the light wavelength A, 
and the pupil diameter D? 

b. Diffraction is the result of the photons in the beam acquiring a vertical momen- 
tum Ap ir What is the scaling exponent /3 in 9 oc (A p v )^? 

c. The Heisenberg uncertainty principle from quantum mechanics says that the 
uncertainty in the photon's vertical momentum A p v is inversely proportional 
to the pupil diameter D. What is therefore the scaling exponent j in 9 oc D 7 ? 

d. Therefore find 9 as a function of A and D. 

e. Estimate a pupil diameter and the resulting diffraction angle. The light-sensi- 
tive cells in the retina that we use in bright light are the cones. They are most 
dense in the fovea — the central region of the retina that we use for reading and 
any other task requiring sharp vision. At that location, their density is roughly 
0.5 x 10 7 per square centimeter. Is that what you would predict? 

Problem 5.40 Kepler's third law for non-inverse-square force laws 

With an inverse-square force, Kepler's third law — the relation between orbit radius 
and period — is T oc r 3/2 (Section 4.2.2). Now generalize the law to forces of the 
form F oc r n , using dimensional analysis to find the scaling exponent /3 in the 
orbital period T oc as a function of the scaling exponent n in the force law. 

Problem 5.41 Ground state energy for general potentials 

Just as we can apply dimensional analysis to classical orbits (Problem 5.40), we can 
apply it to quantum orbits. When the force law is F oc r -2 (electrostatics), or the 
potential is V oc r~ l , we have hydrogen, for which we estimated the binding or 
ground-state energy. Now generalize to potentials where V = CrP. 

The relevant quantities are the ground-state energy E 0 , the proportionality con- 
stant C in the potential, the quantum constant h, and the particle's mass m . These 
four quantities, built from three dimensions, form one independent dimensionless 
group. But it is not easy to find. Therefore, use linear algebra to find the exponents 
7 , &, and e that make E 0 C 7 h i m £ dimensionless. In the case /3 = — 1 (electrostatics), 
check that your result matches our result for hydrogen. 




5.5.2 Blackbody radiation 

With quantum mechanics and its new constant h, we can explain the surface 
temperatures of planets or even stars. The basis is blackbody radiation: A 
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hot object — a so-called blackbody — radiates energy. (Here, "hot" means 
warmer than absolute zero, so every object is hot.) Hotter objects radiate 
more, so the radiated energy flux F depends on the object's temperature T. 

How are the energy flux F and the surface temperature T connected? 



I stated the connection in Section 4.2, but now we know enough dimen- 
sional analysis to derive almost the entire result without a detailed study 
of the quantum theory of radiation. Thus, let's follow the steps of dimen- 
sional analysis to find F, and start by listing the relevant quantities. 



The list begins with the goal, the energy flux F. Blackbody radiation can 
be understood properly through the quantum theory of radiation, which 
is the marriage of quantum mechanics to classical electrodynamics. (A dia- 
gram of the connection, which requires the subsequent tool of easy cases, is 
in Section 8.4.2.) For the purposes of dimensional analysis, this theory sup- 
plies two constants of nature: h from quantum mechanics and the speed 
of light c from classical electrodynamics. The list also includes the object's 
temperature T, so that dimensional analysis knows how hot the object is; 
but we'll include it as the thermal energy k B T. 



The four quantities built from three independent 
dimensions produce one independent dimension- 
less group. Our usual route to finding this group, 
by looking for dimensions that appear in only one 
or two variables, fails because mass, like length, ap- 
pears in three quantities, and time appears in all four. An alternative is to 
apply linear algebra, as you practiced in Problem 5.41. But it is messy. 



F 


MT“ 3 


energy flux 


k B T 


ML 2 T“ 2 


thermal energy 


h 


ml 2 t _1 


quantum 


c 


LT -1 


speed of light 



A less brute-force and more physically meaningful alternative is to choose 
a system of units where c = 1 and h = 1. Each of these two choices has 
a physical meaning. Before using the choices, it is worth understanding 
these meanings. Otherwise we are back to pushing symbols around. 

The first choice, c = 1, expresses the unity of space and time embedded and 
embodied in Einstein's theory of special relativity. With c = 1, length and 
time have the same dimensions and are measured in the same units. We 
can then say that the Sun is 8.3 minutes away from the Earth. That time is 
equivalent to the distance of 8.3 light minutes (the distance that light travels 
in 8.3 minutes). 



The choice c = 1 also expresses the equivalence between mass and energy. 
We use this choice implicitly when we say that the mass of an electron is 
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5 x 10 5 electron volts — which is an energy. The complete statement is that, 
in the usual units, the electron's rest energy m e c 2 is 5 x 10 5 electron volts. 
But when we choose c = 1, then 5 x 10 5 electron volts is also the mass m e . 

The second choice, h = 1, expresses the fundamental insight of quantum 
mechanics, that energy is (angular) frequency. The complete connection 
between energy and frequency is 

E = hco. ( 5 . 104 ) 

When h — 1, then E is co. 

These two unit choices implicitly incorporate their physical meanings into 
our analysis. Furthermore, the choices so simplify our table of dimensions 
that we can find the proportionality between flux and temperature without 
any linear algebra. 

First, choosing c = 1 makes the dimensions of length and time equivalent: 
c = 1 implies L = T. ( 5 . 105 ) 

Thus, in the table we can replace T with L. 

Then adding the choice h = 1 contributes the dimensional equation 

ML 2 T“ 2 = T -1 ( 5 . 106 ) 

[E] M 

It looks like a useless mess; however, replacing T with L (from c = 1) sim- 
plifies it: 

M = L -1 . ( 5 . 107 ) 

In summary, setting c = 1 and h = 1 makes length and time equivalent 
dimensions and makes mass and inverse length equivalent dimensions. 

With these equivalences, let's rewrite the table of dimensions, one quantity 
at a time. 

1. Flux F. Its dimensions, in the usual system, were MT _ . In the new 
system, T -3 is equivalent to M 3 , so the dimensions of flux become M 4 . 

2. Thermal energy k B T. Its dimensions were ML T . In the new system, 
T -2 is equivalent to L -2 , so the dimensions of thermal energy become 
M. And they should: Choosing c = 1 makes energy equivalent to mass. 

3. Quantum constant h. By fiat (when we chose h = 1), it is now dimension- 
less and just 1, so we do not list it. 

4. Speed of light c. Also by fiat, c is just 1, so we do not list it either. 




184 



5 Dimensions 



Our list has become shorter and the dimensions in it p M 4 ener gy fl ux 
simpler. In the usual system, the list had four quan- M thermal energy 

tities (F, k B T, h, and c) and three independent dimen- 

sions (M, L, and T). Now the list has only two quantities (F and k B T) and 
only one independent dimension (M). In both systems, there is only one 
dimensionless group. 

In the usual system, the Buckingham Pi calculation is as follows: 

4 quantities 

— 3 independent dimensions (5.108) 

1 independent dimensionless group 

In the new system, the calculation is 

2 quantities 

— 1 independent dimension (5.109) 

1 independent dimensionless group 



(If the number of independent dimensionless groups had not been the same, 
we would know that we had made a mistake in converting to the new sys- 
tem.) In the simpler system, the independent dimensionless group propor- 
tional to F is now almost obvious. Because F contains M 4 and k B T contains 
M 1 , the group is just F/(k B T) 4 . 



Alas, no lunch is free and no good deed goes unpunished. Now we pay 
the price for the simplicity: We have to restore c and h in order to find the 
equivalent group in the usual system of units. Fortunately, the restoration 
doesn't require any linear algebra. 

The first step is to compute the usual dimensions of F/(k B T) 4 : 




MT ~ 3 

(ML 2 T “ 2 ) 4 



= M“ 3 L“ 8 T 5 . 



(5.110) 



The rest of the analysis for making this quotient dimensionless just rolls 
downhill without our needing to make any choices. The only way to get 
rid of M -3 is to multiply by F 3 : Only h contains a dimension of mass. 

The result has dimensions L T : 
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To clear these dimensions, multiply by c 2 . The independent dimensionless 
group proportional to F is therefore 



Fh 3 c 2 
0 k B T ) 4 ' 



(5.112) 



As the only independent dimensionless group, it is a constant, so 

F (h D 4 

h 3 c 2 ' 

Including the dimensionless prefactor hidden in the twiddle (~), 
2 V ^ 

-p B y4 

60 h 3 c 2 



(5.113) 



(5.114) 



This result is the Stefan-Boltzmann law, and all the constants are lumped 
into cr, the Stefan-Boltzmann constant: 

r2 



cr = 



_J4_ 

60 h 3 c 2 



n 



(5.115) 



In Section 4.2.1, we used the scaling exponent in the Stefan-Boltzmann law 
to estimate the surface temperature of Pluto: We started from the surface 
temperature of the Earth and used proportional reasoning. Now that we 
know the full Stefan-Boltzmann law, we can directly calculate a surface 
temperature. In Problem 5.43, you will estimate the surface temperature of 
the Earth (and you then try to explain the life-saving discrepancy between 
prediction and reality). Here, we'll estimate the surface temperature of the 
Sun using the Stefan-Boltzmann law, the solar flux at the top of the Earth's 
atmosphere, and proportional reasoning. 



Let's work backward from our goal toward what we know. We would like 
to find the Sun's surface temperature T Sun . If we knew the energy flux F Sun 
at the Sun's surface, then the Stefan-Boltzmann law would give us the tem- 
perature: 



T’sun — 




(5.116) 



We can find F Sun from the solar flux F at the Earth's orbit by using propor- 
tional reasoning. Flux, as we argued in Section 4.2.1, is inversely propor- 
tional to distance squared, because the same energy flow (a power) gets 
smeared over a surface area proportional to the square of the distance from 
the source. Therefore, 
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F 



Sun's surface 



F 



^Earth's orbit 
^Sun 




(5.117) 



where R Sun is the radius of the Sun. 

A related distance ratio is. 

Ds un _ 2R Sun 

/ 

^Earth's orbit ^Earth's orbit 



(5.118) 



where D Sun is the diameter of the Sun. This ratio is the angular diameter of 
the Sun, denoted 0 Sun . Therefore, 



^Earth's orbit 
^Sun 



2 

$Sun 



(5.119) 



From a home experiment measuring the angular diameter of the Moon, 
which has the same angular diameter as the Sun, or from the table of con- 
stants, the angular diameter (9 Sun is approximately 10 -2 . Therefore, the dis- 
tance ratio is approximately 200, and the flux ratio is approximately 200 2 or 
4 x 10 4 . 



Continuing to unwind the reasoning chain, we get the flux at the Sun's 
surface: 



^ Sun's surface ~ 4 x 10 4 x 1300 Wm 2 S3 5 x 10 7 W m 2 . (5.120) 

F 



This flux, from the Stefan-Boltzmann law, corresponds to a surface temper- 
ature of roughly 5400 K: 



[ Sun 



/ 5 x 10 7 Wm~ 2 V /4 

\ 6 x 10 -8 W m -2 K -4 / 



5400 K. 



(5.121) 



This prediction is quite close to the measured temperature of 5800 K. 

Problem 5.42 Redo the blackbody-radiation analysis using linear algebra 

Use linear algebra to find the exponents y, z, and iv that make the combination 
F x (kfiT) !, h z c w dimensionless. 

Problem 5.43 Blackbody temperature of the Earth 

Use the Stefan-Boltzmann law F = cT 4 to predict the surface temperature of the 
Earth. Your prediction should be somewhat colder than reality. How do you ex- 
plain the (life-saving) difference between prediction and reality? 

Problem 5.44 Lengths related by the fine-structure constant 

Arrange these important atomic-physics lengths in order of increasing size: 
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a. the classical electron radius (Problem 5.37) r 0 = ( {e 2 /4iie 0 )/m e c 2 (roughly the 
radius of an electron if its rest energy were entirely electrostatic energy). 

b. the Bohr radius a 0 from Section 5.5.1. 

c. the reduced wavelength A of a photon with energy 2E 0 , where A = X/2n (analo- 
gously to how ft = h/lnorf = u>/2n) and E 0 is the binding energy of hydrogen. 
(The energy 2 E 0 , also the absolute value of the potential energy in hydrogen, is 
called the Rydberg). 

d. the reduced Compton wavelength of the electron, defined as h/m e c. 

Place the lengths on a logarithmic scale, and label the gaps (the ratios of successive 

lengths) in terms of the fine-structure constant a. 



5.5.3 Molecular binding energies 



We studied hydrogen, which as an element scarcely exists on Earth, mainly 
to understand chemical bonds. Chemical bonds are formed by attractions 
between electrons and protons, so the hydrogen atom is the simplest chem- 
ical bond. The main defect of this model is that the electron-proton bond 
in a hydrogen atom is much shorter than in most bonds. A typical chemical 
bond is roughly 1.5 angstroms, three times larger than the Bohr radius. Be- 
cause electrostatic energy scales as E oc 1/r, a typical bond energy should 
be smaller than hydrogen's binding energy by a factor of 3. Hydrogen's 
binding energy is roughly 14 electron volts, so C bond is roughly 4 electron 
volts — in agreement with the bond-energy values tabulated in Section 2.1. 



Another important bond is the hydrogen bond . These intermolecular bonds, 
which hold water molecules together, are weaker than the intramolecular 
hydrogen-oxygen bonds within a water molecule. But hydrogen bonds de- 
termine important properties of the most important liquid on our planet. 
For example, the hydrogen-bond energy determines water's heat of vapor- 
ization — which determines much of our weather, including the average 
rainfall on the Earth (as we found in Section 3.4.3). 



To estimate the strength of hydrogen bonds, use the propor- 
tionality h + 



E 



electrostatic 



oc 



c h c h 

r 



(5.122) 



| hydrogen 
1 bond 

o~ 



A hydrogen bond is slightly longer than a typical bond. Instead x 

of the typical 1.5 angstroms, it is roughly 2 angstroms. The 
bond length is in the denominator, so the greater length reduces the bond 
energy by a factor of 4/3. 
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Furthermore, the charges involved, q 2 and q 2 , are smaller than the charges 
in a regular, intramolecular bond. A regular bond is between a full proton 
and a full electron of charge. A hydrogen bond, however, is between the 
excess charge on oxygen and the corresponding charge deficit on hydro- 
gen. The excess and deficit are significantly smaller than a full charge. On 
oxygen the excess maybe 0.5e (where e is the electron charge). On each hy- 
drogen, because of conservation, it would then be 0.25e. These reductions 
in charge contribute factors of 1/2 and 1/4 to the hydrogen-bond energy. 
The resulting energy is roughly 0.4 electron volts: 

4eVx ^ x ^ x 7 k 0.4 eV. (5.123) 

4 2 4 

This estimate is not bad considering the rough numbers that it used. Empir- 
ically, a typical hydrogen bond is 23 kilojoules per mole or about 0.25 elec- 
tron volts. Each water molecule forms close to four hydrogen bonds (two 
from the oxygen to foreign hydrogens, and one from each hydrogen to a 
foreign oxygen). Thus, each molecule gets credit for almost two hydrogen 
bonds — one-half of the per-molecule total in order to avoid counting each 
bond twice. Per water molecule, the result is 0.4 electron volts 

Because vaporizing water breaks the hydrogen bonds, but not the intramol- 
ecular bonds, the heat of vaporization of water should be approximately 
0.4 electron volts per molecule. In macroscopic units, it would be roughly 
40 kilojoules per mole — using the conversion factor from Section 3.2.1, that 
1 electron volt per molecule is roughly 100 kilojoules per mole. 

Because the molar mass of water is 1.8 x 10~ 2 kilograms, the heat of vapor- 
ization is also 2.2 megajoules per kilogram: 

40 kj 1 mol 2.2 x 10 6 J 

f X r— « : (5.124) 

mol 1.8xl0 _2 kg kg 

And it is (p. xvii). We used this value in Section 3.4.3 to estimate the average 
worldwide rainfall. The amount of rain, and so the rate at which many 
plants grow, is determined by the strength of hydrogen bonds. 

Problem 5.45 Rotational energies 

The quantum constant h is also the smallest possible angular momentum of a ro- 
tating system. Use that information to estimate the rotational energy of a small 
molecule such as water (in electron volts). To what electromagnetic wavelength 
does this energy correspond, and where in the electromagnetic spectrum does it 
lie (for example, radio waves, ultraviolet, gamma rays)? 
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5.5.4 Stiffness and sound speeds 

An important macroscopic consequence of the per-atom and per-molecule 
energies is the existence of solid matter: substances that resist bending, 
twisting, squeezing, and stretching. These resistance properties are anal- 
ogous to spring constants. 

However, a spring constant is not the quantity to estimate first, because it is 
not invariant under simple changes. For example, a thick bar resists stretch- 
ing more than a thin bar does. Similarly, a shorter bar resists stretching 
more than a longer bar does. The property independent of the shape or 
amount of substance — invariant to these changes — is the stiffness or elastic 
modulus. There are several elastic moduli, of which the Young's modulus 
is the most broadly useful. It is defined by 
stress cr applied to the substance 

Y = — . ( 5 . 125 ) 

fractional change in its length [(A/) /I] 



The ratio in the denominator occurs so often that it has its own name and 
symbol: the strain e. 



Stress, like the closely related quantity pressure, l 

is force per area: It is the applied force F divided — 
by the cross-sectional area. The denominator, the 

fractional change in length, is the dimensionless ratio (A/)//. Thus, the di- 
mensions of stiffness are the dimensions of pressure — which are also the 
dimensions of energy density. To see the connection between pressure and 
energy density, multiply the definition of pressure (force per area) by length 
over length: 



force length energy 

pressure = x — = — . 

area length volume 



( 5 . 126 ) 



As energy per volume, we can estimate a typical elastic modulus Y. For the 
numerator, a suitable energy is the typical binding energy per atom — the 
energy Finding required to remove the atom from the substance by breaking 
its bonds to surrounding atoms. For the denominator, a suitable volume 
is a typical atomic volume fl 3 , where a is a typical interatomic spacing (3 
angstroms). The result is 



Y 



E 



binding 



a 



3 



( 5 . 127 ) 



This derivation is slightly incomplete: In multiplying the definition of pres- 
sure by length over length, we knew only that the two lengths have the 
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same dimensions, but not whether they have comparable values. If they do 
not, then the estimate for Y needs a dimensionless prefactor, which might 
be far from 1. Therefore, before we estimate Y, let's confirm the estimate by 
using a second method, an analogy — a form of abstraction that we learned 
in Section 2.4. 

The analogy, with which we also began this discussion of stiffness, is be- 
tween a spring constant ( k ) and a Young's modulus (Y). There are three 
physical quantities in the analogy. 

1. Stiffness. For a single spring, we measure stiffness using A:. For a material, 
we use the Young's modulus Y. Therefore, k and Y are analogous. 

2. Extension. For a single spring, we measure extension using the absolute 
length change Ax. For a material, we use the fractional length change 
(A l)/l (the strain c). Therefore, Ax and (A l) /l are analogous. 

3. Energy. For a single spring, we measure energy using the energy E di- 
rectly. For a material, we use an intensive quantity, the energy density 
£ = E/V. Therefore, E and f are analogous. 

Because the energy in a spring is 

E ~ k( Ax) 2 , (5.128) 



by analogy the energy density in a material will be 



£ 




(5.129) 



Let's estimate Y by setting AZ — Z. Physically, this choice means stretching 
or compressing each bond by its own length. This harsh treatment is, as a 
reasonable assumption, sufficient to break the bond and release the binding 
energy. Then the right side becomes simply the Young's modulus Y. 



The left side, the energy density, becomes simply the bond energy per vol- 
ume. For the volume, if we use the volume of one atom, roughly a 3 , then 
the bond energy contained in this volume is the binding energy of one atom. 
Then the Young's modulus becomes 



Y 



c 

^binding 




(5.130) 



This result is what we predicted based on the dimensions of pressure. How- 
ever, we now have a physical model of the Young's modulus, based on an 
analogy between it and spring constant. 
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Having arrived at this conclusion from dimensions and from an analogy, 
let's apply it to estimate a typical Young's modulus. To estimate the numer- 
ator, the binding energy, start with the typical bond energy of 4 electron 
volts per bond. With each atom is connected to, say, five or six other atoms, 
the total energy is roughly 20 electron volts. Because bonds are connec- 
tions between pairs of atoms, this 20 electron volts counts each bond twice. 
Therefore, a typical binding energy is 10 electron volts per atom. 



Using a ~ 3 angstroms, a typical stiffness or Young's modulus is 



Y 



-'binding 



10 eV 



(3x10 10 m) 3 



1.6 x 10 _iy J 1 
eV ~ 2 



x 10 11 J 



m -3 . (5.131) 



For very rough estimates, a convenient value to remember 
is 10 11 joules per cubic meter. Because energy density and 
pressure have the same dimensions, this energy density is 
also 10 11 pascals or 10 6 atmospheres. (Because atmospheric 
pressure is only 1 atmosphere, a factor of 1 million smaller, 
it has little effect on solids.) 

Using a typical stiffness, we can estimate sound speeds in 
solids. The speed depends not only on the stiffness, but also 
on the density: denser materials respond more slowly to the 
forces due to the stiffness, so sound travels more slowly in 
denser substances. From stiffness Y and density p, the only 
dimensionally correct way to make a speed is ^Y / p . If this 
speed is the speed of sound, then 



Y (10 11 Pa) 



diamond 


12 


steel 


2 


Cu 


1.2 


AI 


0.7 


glass 


0.6 


granite 


0.3 


Pb 


0.18 


concrete 


0.17 


oak 


0.1 


ice 


0.1 




(5.132) 



Based on a typical Young's modulus of 0.5 x 10 11 pascals and a typical den- 
sity of 2.5 x 10 3 kilograms per cubic meter (for example, rock), a typical 
speed of sound is 5 kilometers per second: 



0.5 x 10 11 Pa 
2.5 x 10 3 kg m - 



5kms 



(5.133) 



This prediction is reasonable for most solids and is exactly correct for steel! 
With this estimate, we end our dimensional-analysis tour of the properties 
of materials. Look what dimensional analysis, combined with analogy and 
proportional reasoning, has given us: the size of atoms, the energies of 
chemical bonds, the stiffness of materials, and the speed of sound. 
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5.6 Summary and further problems 

A quantity with dimensions has no meaning by itself. As Socrates might 
have put it, the uncompared quantity is not worth knowing. Using this 
principle, we learned to rewrite relations in dimensionless form: in terms of 
combinations of the quantities where the combination has no dimensions. 
Because the space of dimensionless relations is much smaller than the space 
of all possible relations, this rewriting simplifies many problems. Like the 
other two tools in Part II, dimensional analysis discards complexity without 
loss of information. 



Problem 5.46 Oblateness of the Earth 

Because the Earth spins on its axis, it is an oblate 
sphere. You can estimate the oblateness using di- 
mensional analysis and some guessing. Our mea- 
sure of oblateness will be AR = R eq — Rp 0 i ar (the 
difference between the polar and equatorial radii). 

Find two independent dimensionless groups built 
from AR, g, R (the Earth's mean radius), and v (the 
Earth's rotation speed at the equator). Guess a rea- 
sonable relation between them in order to estimate AR. Then compare your esti- 
mate to the actual value, which is approximately 21.4 kilometers. 

Problem 5.47 Eluge waves on deep water 

One of the highest measured ocean waves was encountered in 1933 by a US Navy 
oiler, the USS Ramapo (a 147-meter-long ship) [34]. The wave period was 14.8 sec- 
onds. Find its wavelength using the results of Problem 5.11. Would a wave of this 
wavelength be dangerous to the ship? 

Problem 5.48 Ice skating 

For world-record ice skating, estimate the power consumed by sliding friction. (Ice 
skates sliding on ice have a coefficient of sliding friction around 0.005.) Then make 
that power meaningful by estimating the ratio 

power consumed in sliding friction 

-7 -T : ^7- 7 • (5-134) 

power consumed by air resistance 

Problem 5.49 Pressure melting during ice skating 

Water expands when it freezes. Thus, increasing the pressure on ice should, by Le 
Chatelier's principle, push it toward becoming water — which lowers its freezing 
point. Based on the freezing point and the heat of vaporization of water, estimate 
the change in freezing point caused by ice-skate blades. Is this change large enough 
to explain why ice-skate blades slip with very low friction on a thin sheet of water? 
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Problem 5.50 Contact radius 

A solid ball of radius R, density p, and elastic modulus 
Y rests on the ground. Using dimensional analysis, how 
much can you deduce about the contact radius r? 

Problem 5.51 Contact time 

The ball of Problem 5.50 is dropped from a height, hits a 
hard table with speed v, and bounces off. Using dimen- 
sional analysis, how much can you deduce about the con- 
tact time? 

Problem 5.52 Floating on water 

Some insects can float on water thanks to the surface tension of water 
the bug size l (a length), find the scaling exponents a and /3 in 

F 7 °c /*, 

W oc if, 

where F ? is the surface-tension force and W is the bug's weight. (Surface tension 
itself has dimensions of force per length.) Thereby explain why a small-enough 
bug can float on water. 

Problem 5.53 Dimensionless measures of damping 

A damped spring-mass system has three parameters: the spring constant k, the 
mass m, and the damping constant 7 . The damping constant determines the damp- 
ing force through F ? = —yv, where v is the velocity of the mass. 

a. Use these quantities to make the dimensionless group proportional to 7 . Me- 
chanical and structural engineers use this group to the define the dimensionless 
damping ratio £ : 

1 

£ = - x the dimensionless group proportional to 7 . (5.136) 

b. Find the dimensionless group proportional to 7 -1 . Physicists and electrical 
engineers, following conventions from the early days of radio, call this group 
the quality factor Q. 

Problem 5.54 Steel cable under its own weight 

The stiffness of a material should not be confused with its strength! Strength is the 
stress (a pressure) at which the substance breaks; it is denoted Like stiffness, 
it is an energy density. The dimensionless ratio (Tyler , called the yield strain e„, 
has a physical interpretation: the fractional length change at which the substance 
breaks. For most materials, it lies in the range 10 - 3 ...10 -2 — with brittle materials 
(such as rock) toward the lower end. Using the preceding information, estimate 
the maximum length of a steel cable before it breaks under its own weight. 



. In terms of 



(5.135) 
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Problem 5.55 Orbital dynamics 

A planet orbits the Sun in an ellipse that can be described 
by the distance of closest approach r min and by the fur- 
thest distance r max . The length l is their harmonic mean: 

; = 2 Wmax _ 2 (r min || r max ). ( 5 . 137 ) 

'min ' 'max 

(You will meet the harmonic mean again in Problem 8.22, 
as an example of a more general kind of mean.) 





The table gives r min and r max for the planets, as well as the specific effective poten- 
tial V , which is the effective potential energy divided by the planet mass m (the 
effective potential itself mixes the gravitational potential energy with one compo- 
nent of the kinetic energy). The purpose of this problem is to see how universal 
functions organize this seemingly messy data set. 





hnin(m) 


^max ( m ) 


V (m 2 s 2 ) 


Mercury 


4.6001 xlO 10 


6.9818 xlO 10 


-1.1462 xlO 9 


Venus 


1.0748 x 10 11 


1.0894 xlO 11 


-6.1339 xlO 8 


Earth 


1.4710 xlO 11 


1.5210 xlO 11 


-4.4369 x 10 8 


Mars 


2.0666 xlO 11 


2.4923 x 10 11 


-2.9119 xlO 8 


Jupiter 


7.4067 x 10 11 


8.1601 x 10 11 


— 8.5277 x 10 7 


Saturn 


1.3498 xlO 12 


1.5036 xlO 12 


-4.6523 x 10 7 


Uranus 


2.7350 x 10 12 


3.0063 x 10 12 


-2.3122 xlO 7 


Neptune 


4.4598 x 10 12 


4.5370 x 10 12 


-1.4755 xlO 7 



a. On a graph of V versus r , plot all the data. Each planet provides two data 
points, one for r = r min and one for r = r max . The plot should be a mess. But 
you'll straighten it out in the rest of the problem. 

b. Now write the relation between V and r in dimensionless form. The relevant 
quantities are V , r, GMg un , and the length /. Choose your groups so that V 
appears only in one group and r appears only in a separate group. 

c. Now use the dimensionless form to replot the data in dimensionless form. All 
the points should lie on one curve. You have found the universal function char- 
acterizing all planetary orbits! 

Problem 5.56 Signal propagation speed in coaxial cable 

For the coaxial cable of Problem 2.25, estimate the signal propagation speed. 

Problem 5.57 Meter stick under pressure 

Estimate how much shorter a steel meter stick becomes due to being placed at the 

bottom of the ocean. What about a meter stick made of wood? 
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Problem 5.58 Speed of sound in water 

Using the heat of vaporization of water as a measure of the energy density in its 
weakest bonds, estimate the speed of sound in water. 

Problem 5.59 Delta-function potential 

A simple potential used as a model to understand molecules is the one-dimen- 
sional delta-function potential V (x) = — E 0 LS(x ), where E g is an energy and L is a 
length (imagine a deep potential of depth E 0 and small width L). Use dimensional 
analysis to estimate the ground-state energy. 

Problem 5.60 Tube flow 

In this problem you study fluid flow through a narrow 
tube. The quantity to predict is Q, the volume flow rate 
(volume per time). This rate depends on five quantities: 

/ the length of the tube 

A p the pressure difference between the tube ends 

r the radius of the tube 
p the density of the fluid 
v the kinematic viscosity of the fluid 

a. Find three independent dimensionless groups Gi, G 2 , and G 3 from these six 
quantities — preparing to write the most general statement as 

group 1 = /(group 2, group 3). ( 5 . 138 ) 

Hint: One physically reasonable group is G 2 = r/l; to make solving for Q pos- 
sible, put Q only in group 1 and make this group proportional to Q. 

b. Now imagine that the tube is long and thin (l » r) and that the radius or flow 
speed is small enough to make the Reynolds number low. Then deduce the 
form of / using proportional reasoning: First find the scaling exponent /3 in 
Q oc (A p)f-, then find the scaling exponent 7 in Q oc P. Hint: If you double A p 
and l, what should happen to Q? 

Determine the form of / that satisfies all your proportionality requirements. 
If you get stuck, work backward from the correct result. Look up Poiseuille 
flow, and use the result to deduce the preceding proportionalities; and then 
give reasons for why they are that way. 

c. The analysis in the preceding parts does not give you the universal, dimension- 
less constant. Use a syringe and needle to estimate the constant. Compare your 
estimate with the value that comes from solving the equations of fluid mechan- 
ics honestly (by looking up this value). 
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Problem 5.61 Boiling versus boiling away 

Look up the specific heat of water in the table of constants (p. xvii) and estimate 
the ratio 

energy to bring a pot of water to boiling temperature 
energy to boil away the boiling water 



Problem 5.62 Kepler's law for elliptical orbits 

Kepler's third law connects the orbital period to the minimum and maximum or- 
bital radii r min and r max and to the gravitational strength of the Sun: 



T = 



2 n 



a 3 ' 2 

-yj GMg un 



(5.140) 



where the semimajor axis a is defined as a = (r min + r max ) /2. Write Kepler's third 
law in dimensionless form, making one independent dimensionless group propor- 
tional to T and the other group proportional to r min . 



Problem 5.63 Why Mars? 

Why did Kepler need data about Mars's orbit to conclude that planets orbit the 
Sun in an ellipse rather than a circle? Hint: See the data in Problem 5.55. 



Problem 5.64 Froude number for a ship's hull speed 

For a ship, the hull speed is defined as 

v = 1.34VT, (5.141) 

where v is measured in knots (nautical miles per hour), and /, the waterline length, 
is measured in feet. The waterline length is, as you might expect, the length of the 
boat measured at the waterline. The hull speed is a boat's maximum speed before 
the water drag becomes very large. 

Convert this unit-specific formula to an approximate Froude number Fr, the di- 
mensionless number introduced in Section 5.1.1 to estimate the maximum walking 
speed. For the hull speed, the Froude number is defined as 

v 2 

Fr = — . (5.142) 

gl 

From the approximate Froude number, guess the exact value! 





Part III 



Discarding complexity 
with loss of information 



You've organized (Part I); you've discarded complexity without losing infor- 
mation (Part II); yet the phenomenon still resists understanding. When the 
going gets tough, the tough lower their standards: Approximate first, and 
worry later. Otherwise you never start, and you can never learn that the 
approximations would have been accurate enough — if only you had gath- 
ered the courage to make them. Helping you make these approximations 
is purpose of our final set of tools. 

These four tools help us discard complexity while losing information. First, 
we round or lump complicated numbers and graphs (Chapter 6). Second, 
we accept that our knowledge is incomplete, and we quantify the uncer- 
tainty with the tool of probability (Chapter 7). Third, we study simpler 
versions of hard problems — the tool of easy cases (Chapter 8). Fourth and 
finally, by making spring models (Chapter 9), we approximate and can un- 
derstand many phenomena, including cooking times, sound speeds, and 
the color of the sky and the sunset. 
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In 1982, thousands of students in the United States 
had to estimate 3.04x5.3, choosing 1.6, 16, 160, 1600, or 
"I don't know." Only 21 percent of 13-year-olds and 37 
percent of 17-year-olds chose 16. As Carpenter and 
colleagues describe [7], the problem is not a lack of 
calculation skill. On questions testing exact multipli- 
cation ("multiply 2.07 by 9.3"), the 13-year-olds scored 
57 percent, and the 17-year-olds scored 72 percent cor- 
rect. The problem is a lack of understanding; if you earn roughly $5 per 
hour for roughly 3 hours, your net worth cannot grow by $1600. The stu- 
dents needed our next tool: rounding or, more generally, lumping. 



choice 


age 13 


age 17 


1.6 


28% 


21% 


16 


21 


37 


160 


18 


17 


1600 


23 


11 


don't know 


9 


12 



6.1 Approximate! 

Fortunately, rounding is inherent in our perception of quantity: Beyond 
three items, our perception of "how many" comes with an inherent impre- 
cision. This fuzziness is, for adults, 20 percent: If we briefly see two groups 
of dots whose totals are within 20 percent, we cannot easily judge which 
group has more dots. Try it by glancing at the following squares. 
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In the left pair, one square contains 10 percent more dots than the other 
square; in the right pair, one square contains 30 percent more dots than the 
other square. In the 10-percent pair, spotting the more numerous square 
is difficult. In the 30-percent pair, it is almost obvious at sight. Lump- 
ing comes naturally; we just need the courage to do it. We'll develop the 
courage first in rounding numbers, the most familiar kind of lumping. 




6.2 Rounding on a logarithmic scale 

Just as driving to visit a next-door neighbor atrophies our muscles and abil- 
ity to move around the physical world, asking calculators to do simple arith- 
metic dulls our ability to navigate the quantitative world. We never develop 
an innate sense of sizes and scales in the world. The antidote is to do the 
computations ourselves, but approximately — by placing quantities on a log- 
arithmic scale and rounding them to the nearest convenient value. 

6.2.1 Rounding to the nearest power of ten 

The simplest method of rounding is to round every number to the nearest 
power of ten. That simplification turns most calculations into adding and 
subtracting integer exponents (the exceptions come from roots, which pro- 
duce fractional exponents). Here, "nearest" is judged on a logarithmic scale, 
where distance is measured not with differences but with ratios or factors. 
For example, 50 — although closer to 10 than to 100 on a linear scale — is a 
factor of 5 greater than 10 but only a factor of 2 smaller than 100. Therefore, 
50 is closer to 100 than to 10 and would get rounded to 100. 

As practice, let's estimate the number of minutes in a day: 

, , 24 hours 60 minutes 

1 day x x — = 24 x 60 mmutes. (6.1) 

day hour 

Now we round each factor to the nearest power of 10. Because 24 is only 
a factor of 2.4 away from 10, but more than a factor of 4 away from 100, it 
gets rounded to 10: 
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2.4 4.2 




24 

In contrast, 60 is closer to 100 than to 10: 




With these approximations, 1 day is approximately 1000 minutes: 

from 24 from 60 

- , 10 hours 100 minutes - „„„ 

1 day x x = 1000 minutes. 



day 



hour 



( 6 . 2 ) 



The exact value is 1440 minutes, so the estimate is only 30 percent too small. 
This error is a reasonable tradeoff to gain a method that requires almost no 
effort — who needs a calculator to multiply 10 by 100? Furthermore, the 
accuracy is enough for many calculations, where insight is needed more 
than accuracy. 

Problem 6.1 Rounding to the nearest power of ten 

Round these numbers to the nearest power of ten: 200, 0.53, 0.03, and 7.9. 

Problem 6.2 Boundary between rounding up or down 

We saw that 60 rounded up to 100 but that 24 rounded down to 10. What number 
is just at the boundary between rounding down to 10 and rounding up to 100? 

Problem 6.3 Using rounding to the nearest power of ten 

Round the numbers to the nearest power of ten and thereby estimate the products: 

(a) 27 x 50, (b) 432 x 12, and (c) 3.04 x 5.3. 

Problem 6.4 Calculating the bending of light 

In Section 5.3.1, we used dimensional analysis to show that the Earth could bend 
starlight by approximately the angle 

G ^ Earth 

6.7 x 10 -11 kg -1 m 3 s -2 x 6xl0 24 kg 

0 ~ 2 Tv — t t • (6.3) 

6.4xl0 6 m x 10 17 m 2 s~ 2 

^Earth C 2 

Round each factor to the nearest power of ten in order to estimate the bending 
angle mentally. How long did making the estimate take? 
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6.2.2 Rounding to the nearest half power of ten 

Rounding to the nearest power of ten gives a quick, preliminary estimate. 
When it is too approximate, we just round more precisely. The next increase 
in accuracy is to round to the nearest half power of ten. 

As an illustration, let's estimate the number of seconds in a year. The nu- 
merical calculation (without the units) is 365 x 24 x 3600. Now we round 
each factor by replacing it with a number of the form 10 ^ . In the previous 
method, where we rounded to the nearest power of ten, f> was an integer. 
Now f> can also be a half-integer (for example, 2.5). 

What is a half power often numerically? 

Two half powers of ten multiply to make 10, so each half power is flO, or 
slightly more than 3 (as you found in Problem 6.2). When you need more 
precision, a half power of ten is 3.2 or 3.16, although that much precision is 
rarely needed. 

In rounding the calculation for the number of seconds, 365 becomes 10 25 , 
and, as diagrammed below, 3600 becomes 10 35 : 



1.1 2.8 




24 



Thus, we replace 24 by 10 15 . Then the calculation simplifies to 

10 2 - 5 x 10 15 x 10 3 - 5 . (6.4) 

~~365~" 24 ~360cT 

Now just add the powers of ten: 

10 2 - 5 x 10 15 x 10 3 - 5 = 10 7 - 5 . (6.5) 

Because the 0.5 in the exponent 7.5 contributes a factor of 3, there are about 
3 x 10 7 seconds in a year. I remember this value as n x 10 7 , which is accurate 
to 0.5 percent. 
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Problem 6.5 Earth's orbital speed 

Using the estimate of n x 10 7 seconds in a year, estimate the Earth's orbital speed 
around the Sun. Don't use a calculator! (The Earth-Sun distance, 1.5 x 10 11 meters, 
is worth memorizing.) 

Problem 6.6 Where does the n come from? 

True or false: The n in n x 10 7 seconds per year arises because the Earth orbits the 
Sun in a circle, and a circle has n in its circumference. 

Problem 6.7 Only approximately pi 

True or false: The n in n x 10 7 seconds per year is not exact because the Earth orbits 
in a slightly noncircular ellipse. 

Problem 6.8 Estimating geometric means 

In Section 2.3, 1 talked to my gut to estimate the US annual oil imports. That dis- 
cussion resulted in the geometric-mean estimate 

V10 million x 1 trillion barrels per year. (6.6) 

Estimate the square root by placing the two quantities 10 million and 1 trillion on 
a logarithmic scale and finding their midpoint. 



6.3 Typical or characteristic values 

Lumping not only simplifies numbers, where it is called rounding, it also 
simplifies complex quantities by creating an abstraction: the typical or char- 
acteristic value. We've used this form of lumping implicitly many times; 
now it's time to discuss it explicitly, in order to appreciate its scope and to 
develop skill in applying it. 

6.3.1 Estimating the population of the United States 

Our first explicit example of a typical or characteristic value occurs in the fol- 
lowing population estimate. Knowing a population is essential in many es- 
timates about the social world, such as a country's oil imports (Section 1.4), 
energy consumption, or per-capita land area (Problem 1.14). Here we'll es- 
timate the population of the United States by dividing it into two factors. 

US population ~ N states x population of a typical US state. (6.7) 

The first factor, N states , everyone in America learns in school: N states = 50. 
The second factor contains the lumping approximation. Rather than using 
all 50 different state populations, we replace each state with a typical state. 
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What is the population of a typical state? 

A large state is California or New York, each of which has a megacity with a 
population in the several millions. A tiny state is Delaware or Rhode Island. 
In between is Massachusetts. Because I live there, I know that it has 6 mil- 
lion people. Taking Massachusetts as a typical state, the US population is 
roughly 50 x 6 million or 300 million. This estimate is more accurate than 
we deserve: The 2012 population is 314 million. 

From this example, you can also see how lumping enhances symmetry rea- 
soning: When there is change, look for what does not change (Section 3.1). 
Here, each state has its own population, so there's plenty of change among 
the list of states. Lumping helps us find, or create, a quantity that does not 
change. We imagine a typical state, one that may not even exist (just as no 
family has the average of 2.3 children), and replace every state with this 
state. We've lumped away all the change — throwing away information in 
exchange for insight into the population of the country. 

Problem 6.9 German federal states 

The Federal Republic of Germany has 16 federal states. Pick one at random, multi- 
ply its population by 16, and compare that estimate with Germany's population. 



6.3.2 Lumping varying physical quantities: How high can animals jump? 

Using typical or characteristic values allows us to reason out seemingly im- 
possible questions while sitting in our armchairs. For practice, we'll study 
how the jump height of an animal depends on its size. For example, should 
a person be able to jump higher than a locust? 

The jump could be a running or a standing high jump. Both types are inter- 
esting, but the standing high jump teaches more about lumping. Therefore, 
imagine that the animal starts from rest and jumps directly upward. 

Even with this assumption, the problem looks underspecified. The jump 
height depends at least on the animal's shape, on how much muscle the 
animal has, and on the muscle efficiency. It is the kind of problem where 
lumping, a tool for making assumptions, is most helpful. We'll use lumping 
and proportional reasoning to find the scaling exponent fin h oc nf’, where 
h is the jump height and m is the animal's mass (our measure of its size). 

Finding a scaling exponent usually requires a physical model. You can of- 
ten build them by imagining an extreme, unrealistic situation, and then 
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asking yourself what physics prevents it from happening. Thus, why can't 
we jump to the Moon? Because it demands a vast amount of energy, far be- 
yond what our muscles can supply. The point to extract from this thought 
experiment is that jumping demands energy, which is supplied by muscles. 



The appearance of supply and demand suggests, as in the estimate of the 
number of taxis in Boston (Section 3.4.1), that we equate the demand to the 
supply. Then we estimate each piece separately — divide and conquer. 



The energy demanded is the gravitational potential energy mgh. Here, 
g is the gravitational acceleration, and the jump height h is measured 
as the vertical change in the animal's center of mass (CM). Because 
all animals feel the same gravity, E demanded = m gh simplifies to the 
proportionality E demanded « mh. 

For the supplied energy, we again divide and conquer: 

^supplied ~ muscle mass x muscle energy density, (6.8) 



animal 



CM 



animal 



CM 



where the muscle energy density is the energy per mass that muscle can 
supply. This product already contains a lumping assumption: that all the 
muscles in an animal provide the same energy density. This assumption is 
reflected in the single approximation sign (~). 

Even so, the result is not simple enough. Each species, and each individ- 
ual within a species, will have its own muscle mass and energy density. 
Therefore, let's make the further lumping assumption that all animals, even 
though they vary in muscle mass, share the same muscle energy density. 
The assumption is plausible because all muscles use similar biological tech- 
nology (actin and myosin filaments). Fortunately, making this assumption 
is an approximation: Lumping throws away actual information, which is 
how it reduces complexity. With the assumption of a common energy den- 
sity, the supplied energy becomes the simpler proportionality 

^supplied ^ muscle mass. (6.9) 



The simplicity of this result reminds us that an approximate result can be 
more useful than an exact result. 



The muscle mass also varies from animal to animal. Introducing a dimen- 
sionless prefactor and making another lumping approximation will man- 
age that complexity. The dimensionless prefactor cc will be the fraction of 
an animal's mass that is muscle: 



^muscle orrn. 



( 6 . 10 ) 
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Alas, a varies across species (compare a cheetah and a turtle), within a 
species, and within the lifetime of an individual — for example, my a is 
dropping as I sit writing this book. If we account for all these variations, 
we will be overwhelmed by their complexity Lumping rescues us: It gives 
us permission to assume that a is the same for every animal. We replace the 
diversity of animals with a typical animal. This assumption is not as crazy 
as it might sound. It doesn't mean that all animals have the same muscle 
mass. Rather, it means that all animals have the same fraction of muscle; as 
an example, for people, cc ~ 0.4. 

With this lumping assumption, m muscle = am becomes the simpler propor- 
tionality m musc | e cx m. Because the supplied energy is proportional to the 
muscle mass, it is proportional to the animal's mass: 

^supplied “ ^muscle m - f 6 - 11 ) 

This result is as simple as we can hope for, and it depends on the right 
quantity, the animal's mass. Now let's use it to predict how an animal's 
jump height depend on this mass. 

Because the demanded energy and supplied energy are equal, and the de- 
manded energy — the gravitational potential energy — is proportional to mil, 

mil oc m. (6.12) 

The common factor of m 1 cancels out, leaving h independent of m: 

h oc m°. (6.13) 

All (jumping) animals should be able to jump to the same height! 

The result always surprises me. My intuition, before doing the calculation, 
cannot decide how h should behave. On the one hand, small animals seem 
strong: Ants can lift a mass many times their own body mass, whereas 
humans can lift only roughly their own body mass. However, my intuition 
also insists that people should be able to jump higher than locusts. 



The data, from Scaling: Why Animal Size Is So Important 
[41, p. 178], contradict my intuition and confirm our 
lumping and scaling analysis. The mass spans more 
than eight orders of magnitude (eight decades, or a 
factor of 10 s ), yet the jump height, as a change in the 
height of the center of mass, varies only by a factor of 3 
(half a decade). On a log-log graph, the heigh t-ver- 





m 


h 


flea 


0.5 mg 


20 cm 


click beetle 


40 mg 


30 cm 


locust 


3g 


59 cm 


human 


70 kg 


60 cm 



sus-mass line would have a slope less than 1/16. Our predicted scaling 
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of constant jump height ( h ex m°), which corresponds to a slope of zero, is 
surprisingly accurate. The outliers, fleas and click beetles, are at the lighter 
end. (For an explanation, try Problem 6.10.) Furthermore, at the heavier 
end, locusts and humans, although differing by more than four orders of 
magnitude in mass, jump to almost exactly the same height. 

A moral of this example is that lumping augments proportional reasoning. 
Proportional reasoning reduces complexity by showing us a notation for 
ignoring quantities that do not vary. For example, when all animals face the 
same gravitational field, then E demanded = simplifies to E demanded oc mil. 
Alas, we live in the desert of the real, where "the same" is almost always 
only an approximation — for example, as with the energy density of muscle 
in different animals. Lumping rescues us. It gives us permission to replace 
these changing values with a single, constant, typical value — making the 
relations amenable to proportional reasoning. 

Problem 6.10 Jumping fleas 

The prediction of constant jump height seems to fail at small sizes: Larger animals 
jump about 60 centimeters, whereas fleas reach only 20 centimeters. In this prob- 
lem, you evaluate whether air drag can explain the discrepancy. 

a. As a lumping approximation, pretend that an animal is a cube with side length 

l, and assume that it jumps to a height h independent of its mass m. Then 
find the scaling exponent /3 in £ drag /£ demanded where £ drag is the energy 

consumed by drag and £ demanded is the energy needed in the absence of drag. 

b. Estimate £ drag /£ d emanded f° r a cubical human that can jump to 60 centimeters. 
Using the scaling relation, estimate it for a cubical flea. What is your judgment 
of drag as an explanation for the lower jump heights of fleas? 



6.3.3 Period of an ideal spring 



A surprising conclusion of dimensional analysis is that the 
period of a spring, or a small-amplitude pendulum, does 
not depend on its amplitude (Problem 5.14). However, the 
mathematical reasoning doesn't give us the why; it doesn't provide a phys- 
ical insight. That insight comes from lumping, by using characteristic val- 
ues. We'll try it together by finding the period of a spring; you'll practice 
by finding the period of a pendulum (Problem 6.11). 



This kind of lumping, in exchange for providing physical insight, requires 
that we make a physical model. Here, a stretched or compressed spring 
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exerts a force on and thereby accelerates the mass. If the spring is stretched 
to an amplitude x 0 , then the force is kx 0 and the acceleration is kx 0 /m. This 
acceleration varies as the mass moves, so analyzing the motion usually re- 
quires differential equations. However, this acceleration is also a character- 
istic acceleration. It sets the scale for the acceleration at all other times. If we 
replace the changing acceleration with this characteristic acceleration, the 
complexity vanishes. The problem becomes a constant-acceleration prob- 
lem where a ~ kx 0 /m. 

With a constant acceleration a for a time comparable to one period T, the 
mass moves a distance comparable to aT 2 , which is kx 0 T 2 /m. When apply- 
ing lumping and characteristic values, "comparable" is the verbal transla- 
tion of the single approximation sign As an equation, we would write 

distance ~ aT 2 . (6.14) 



Another useful translation is "of the order of": The distance is of the order 
of aT 2 . Equivalently, the characteristic distance is aT 2 . 



This characteristic distance must be comparable to the amplitude x 0 . Thus, 
kx 0 T 2 

x o • ( 6 - 15 ) 



m 



The amplitude x 0 cancels out! Then the period is T ~ ^rn/k . Lumping 
thus provides the following explanation for why the period is independent 
of amplitude: As the amplitude and therefore the travel distance increase, 
the force and acceleration increase just enough to compensate, leaving the 
period unchanged. 



Problem 6.11 Period of a small-amplitude pendulum using lumping 

Use characteristic values to explain why the period of a (small-amplitude) pendu- 
lum is independent of the amplitude 0 O . 

Problem 6.12 Period of a nonlinear spring 

Imagine a nonlinear spring with force law F oc x n . Use lumping as follows to find 
how the period T varies with amplitude x 0 . 

a. Estimate a typical or characteristic acceleration. 

b. At this acceleration, roughly how far does the mass travel in one period T? 

c. This distance must be comparable to the amplitude x 0 . Therefore, find the scal- 
ing exponent a in T oc Xg (where a will be a function of the scaling exponent n 
in the force law). Then check your answer to Problem 5.17. 
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6.3.4 Lumping derivatives 

The preceding analysis of the period of a spring-mass system (Section 6.3.3) 
illustrates a general simplification: Using characteristic values, we can re- 
place derivatives with algebra. The algebraic expression usually provides 
a physical model. As an example, let's explain, physically, an acceleration 
that we derived by dimensional analysis in Section 5.1.1: the inward accel- 
eration of an object moving in a circle. 

Acceleration is the derivative of velocity: a = dv/dt. Using the definition of 
derivative, 

dv _ infinitesimal change in v 

dt ~ (infinitesimal) time required to make this change in v 



Infinitesimal changes and times are difficult to picture, so an analysis based 
on calculus often does not help us see why a result is true. 

A lumping approximation, by discarding complexity, can give this insight. 
A way to remember the lumping approximation, first use 6 = 6 to cancel 
the 6s in 16/64: 



_ l 

A4 ~~ 4" 



( 6 . 17 ) 



The result is exact! Although this particular cancellation is dubious, it sug- 
gests the analogous lumping approximation d = d. The resulting cancella- 
tion turns derivatives into algebra: 



jdfv v 




What does v/t mean? 



Lumping replaces "infinitesimal" with "characteristic": 
y characteristic change in v 

t time required to make this change in v 

The numerator asks us to look at the changes in v and to represent them by 
a characteristic or typical change. The denominator is, as an abbreviation, 
often called the characteristic time, or time constant, and denoted r. 

In applying this approximation to circular motion, we have to distinguish 
the velocity vector v from its magnitude v (the speed). The speed, at least in 
constant-speed circular motion, never changes, so dv/dt itself is zero. We 
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are interested in \dv\/dt: the magnitude of the vector's derivative, rather 
than the derivative of the vector's magnitude. The lumped acceleration a is 

I characteristic change in v I 

a 1 - - - 1 . (6.20) 

time required to make this change m v 

The maximum change in v is reversing direction, from +v to — v. A charac- 
teristic change in v is v itself, or any value comparable to it. This range of 
possibilities is captured by the single approximation sign, which stands for 
"is comparable to." With that notation, 

characteristic change in v ~ v. (6.21) 



Here is an example of such a change, showing the velocity vectors before 
and after the particle has rotated partway around the circle. 




When v makes its significant change, from v old to v new , it produces a change 
in v comparable in magnitude to v. The characteristic time r — the time 
required to make this change — is a decent fraction of a period of revolution. 
Because a full period is 2nr/v, the characteristic time is comparable to r/v. 
Using r to represent the characteristic time, and ~ to hide dimensionless 
prefactors, we write r ~ r/v. 



Here's another way to arrive at r ~ r/v. If the triangle of v old , v new , and 
Av is an equilateral triangle, then it corresponds to the particle rotating 60° 
around the circle, which is almost exactly 1 radian. Because 1 radian cre- 
ates an arc of length r (the same proportionality tells us that 2n radians 
produces the circumference 2nr), the travel time is r/v, and r ~ r/v. Then 
the acceleration is roughly v 1 /r\ 

a — = — . ( 6 . 22 ) 



v_ 

T 



V 

r/v 



_ 

r 



This equation encapsulates a physical, proportional-reasoning explanation 
of a = v 2 / r. Namely, in circular motion, the velocity vector changes di- 
rection significantly in approximately 1 radian of rotation. This motion re- 
quires a time r ~ r/v. Therefore, the circular acceleration a contains two 
factors of v — one factor from the v itself and one factor from the time in the 
denominator — and it contains 1/r, also from the time in the denominator. 
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6.3.5 Simplifying with characteristic values: Yield of the first atomic blast 

In Section 5.2.2, we used dimensional analysis to estimate the 
yield of an atomic blast. We accurately predicted that the blast 
energy E is related to the blast radius R and the air density p: 

pR 5 

E ~ ( 6 . 23 ) 

where t is the time since the blast. However, dimensional analysis, as a 
mathematical argument, does not give a physical explanation for its results, 
which can often feel like magic. Lumping explains the magic by helping 
us analyze a physical model — a model whose analysis would otherwise be 
absurd in its complexity. 

The physical model is that the blast increases the thermal energy of the 
air molecules, thus increasing the speed of sound — which is the speed at 
which the blast expands. Using this model exactly requires setting up and 
solving differential equations, as Taylor did [44]. Lumping, by turning cal- 
culus into algebra, simplifies the equations without discarding their physi- 
cal meaning. 

The first step is to estimate the thermal energy. It comes almost entirely 
from the hot fireball — that is, from the blast energy. This energy is spread 
unevenly over the blast, with a higher energy density nearer the blast center 
and a lower density farther away. For the lumping approximation, smear 
the blast energy E evenly throughout a sphere of radius R. This sphere's 
volume is comparable to R 3 , so the typical or characteristic energy density 
is <5 ~ E/R 3 . 

The next step is to use this energy density to estimate the speed of sound c s . 
As we discussed in Section 5.4.1, the speed of sound is comparable to the 
thermal speed, so 




where k B T is the approximate thermal energy of one air molecule and m is 
the mass of one air molecule. 

To connect this speed to the energy density <£, let's convert energy per mol- 
ecule into energy per volume, by multiplying k B T by the number density n 
(the number of air molecules per volume). The result is nk B T, which is the 
thermal and therefore roughly the blast energy density <5 ~ E/R 3 . 
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To be fair, we also multiply the denominator m by n. This step converts 
mass per molecule into mass per volume, which gives the density p: 

mass molecules mass 

] X ] = — ] • (6.25) 

molecule volume volume 

m n p 

Therefore, the speed of sound is comparable to ^ E/pR 3 : 



nk R T 



nm 





(6.26) 



This speed is the rate at which the blast expands. 



► 



Based on this speed, hozv large should the blast be after time t ? 



Because the energy density and therefore the sound speed decreases as R 
increases, the blast radius is not simply the speed at time t multiplied by the 
time. But we can make a further lumping approximation: that the typical 
or characteristic speed, for the entire time t, is E/pR 3 . In evaluating this 
speed, we'll use the radius R at time t as the characteristic radius. 

blast radius ~ characteristic speed x time. (6.2 7) 

R c s ~^E/pR 3 t 

The solution for the blast energy E is E ~ pR 5 /t 2 , as we found using dimen- 
sional analysis. Lumping complements that mathematical reasoning with 
a physical model. 



6.4 Applying lumping to shapes 

Lumping by replacing varying quantities with their typical, or characteris- 
tic, values is close to the next form of lumping: lumping shapes. Our first 
illustration is explaining a curious fact about everyday materials. 



6.4.1 Densities of liquids and solids 

Among books teaching the art of approximation, a classic is Consider a Spher- 
ical Cow [22], so named because a sphere is a much simpler shape than a cow. 
An even simpler shape is a cube. Thus, a powerful form of lumping is to 
replace complex shapes by a comparably sized cube. With this idea, we can 
explain why most solids and liquids have densities between 1 and 10 times 
the density of water. 
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Each atom is a complex, ill-defined shape, but pretend that it 
is cube. Because the atoms touch, the density of the substance 
is approximately the density of one approximating cube: 

_ mass of the atom ^ 

^ volume of the lumped cube 

To evaluate the numerator, use A as the atom's atomic mass. 

Although A is called a mass, it is dimensionless: It is almost 
exactly the total number of protons and neutrons in the nucleus. Because 
protons and neutrons have almost the same mass, the proton mass m p can 
represent the mass of either constituent. Then the mass of one cube is Am p . 




To find the denominator, the cube's volume, make each cube's side be a typ- 
ical atomic diameter of a ~ 3 angstroms. This size is based on our calcula- 
tion in Section 5.5.1 of the diameter of the smallest atom, hydrogen (whose 
diameter was roughly 1 angstrom). Then the density becomes 



Ani p 
(3 A) 3 " 



(6.29) 



To avoid looking up the proton mass m p , let's multiply this fraction by 
Na/N a , where N A is Avogadro's number: 



A x m p N A 
(3 A ) 3 x N a 



(6.30) 



The numerator is A grams per mole, because 1 mole of protons (roughly, of 
hydrogen) has a mass of 1 gram. 



The denominator is roughly 18 cubic centimeters per mole: 

-.3 



3 x 10 23 cm 3 

(3 A) 3 



6x10“ moF 1 = 1*22! 

mol 



n 4 



(6.31) 



A typical solid or liquid density is then related simply to the substance's 
atomic mass A : 



Agmol 1 _ A_ g 

18 cm 3 mol -1 13 cm 3 



(6.32) 



Problem 6.13 Rounding to estimate atomic volume 

Use rounding to the nearest half power of ten (Section 6.2.2) to show that a cube 
with side length 3 Angstroms has a volume of approximately 3 x 10 -23 cubic cen- 
timeters. 
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Many common elements have atomic masses between 
18 and 180, so the densities of many liquids and solids 
should lie between 1 and 10 grams per cubic centimeter. 

As shown in the table, the prediction is reasonable. The 
table even includes a joker: water is not an element! Yet 
the density estimate is exact. 

The table also shows why centimeters and grams are, for 
materials physics, more convenient than are meters and 
kilograms. A typical solid has a density of a few grams 
per cubic centimeter. Such modest numbers are easy to 
remember and handle. In contrast, a density of 3000 kilo- 
grams per cubic meter, although mathematically equivalent, is mentally un- 
wieldy. On each use, you have to think, "How many powers of ten were 
there again?" Therefore, the table gives densities in the mentally friendly 
units of grams per cubic centimeter. 

Problem 6.14 Accounting for discrepancies in the density prediction 

In the table of densities, iron (Fe) shows the biggest discrepancy between the actual 

and predicted densities, by roughly a factor of 2.5. Use that factor to make an 

improved estimate for the interatomic spacing in iron. 

Problem 6.15 Number density of conduction electrons 

Estimate the number density of free (conduction) electrons in a copper wire. 

Problem 6.16 Typical drift speed in a wire 

a. Use your estimate of the number density of conduction electrons in a copper 
wire (Problem 6.15) to estimate the drift speed of electrons in the wire connect- 
ing a typical lamp to the wall socket. 

b. Estimate the time required for electrons to travel at this drift speed from the 
wall socket to the light bulb. Then how does the light bulb turn on right after 
you flip the switch? 



p (gem 3 ) 
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1.0 


1.0 


Si 


28 


1.6 


2.4 


Fe 


56 
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Hg 
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11.2 


13.5 


U 


238 


13.3 


18.7 


Au 


197 


10.9 


19.3 



6.4.2 Graph lumping: The number of undergraduate students 

A particularly important shape is a graph. When applied to graphs, the idea 
of lumping shapes simplifies many a problem, making evident its essential 
features. As an example of graph lumping, we'll estimate the number of US 
undergraduate students. Such back-of-the-envelope market-size estimates 
are valuable for business planning and for making public policy. 
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The first step is to estimate the number of people in the United States who 
are 18, 19, 20, or 21 years old. This total provides, at least in the United 
States, the pool from which most undergraduate students come. Because 
not all 18-to-21 -year-olds go to college, at the end we will multiply the total 
by the fraction of adults who are college graduates. 




Finding the exact pool size requires the birth date 
of every person in the United States. Although 
these data are collected once every decade by the 
US Census Bureau, they would only overwhelm 
us. As an approximation to the voluminous data, 
the Census Bureau also publishes the number of 
people at each age. For example, the 1991 data 
are the wiggly line in the graph. The left side of the graph represents the 
number of infants and toddlers in 1991, and the right side represents the 
number of older people (also in 1991). The undergraduate pool size, rep- 
resenting all 18-, 19-, 20-, and 21-year-olds, is the shaded area. (The peak 
around the ages 30-35 represents the baby boomers, born in the period after 
World War Two.) 



Unfortunately, even this graph depends on the 
huge resources of the Census Bureau, so it is not 
suited for back-of-the-envelope estimates. It also 
provides little insight or transfer value. Insight 
comes from lumping: from turning the complex, 
wiggly curve into a rectangle. The rectangle's di- 
mensions can be determined without any infor- 
mation from the Census Bureau. 




What are the height and width of this rectangle? 



The rectangle's width is a time, so it must be a 
characteristic time related to the population. A 
good guess is the life expectancy, because the age 
distribution varies significantly over that time. In 
the United States, the life expectancy is roughly 
75 years, which will be the rectangle's width. In 
this lumping approximation, everyone lives hap- 
pily until a sudden death at his or her 75th birthday. This all-or-nothing 
reasoning is the essential characteristic of lumping, making it such a useful 
approximation. 
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The rectangle's height can be computed from its area, which is the US popu- 
lation — approximately 300 million (as we estimated in Section 6.3.1). There- 
fore, the height is 4 million per year: 

area 300 million 4 million 

— Trrr yi — • (6-33) 

width 75 years year 




Based on this estimate, there should be 16 million people (in the United 
States) with the four undergraduate ages of 18, 19, 20, or 21 years. 

Not everyone in this pool is an undergraduate. Therefore, the final step is to 
account for the fraction of adults who are college graduates. In the United 
States, where education has traditionally been widely spread throughout 
the population, this fraction (or adjustment factor) is high — say, 0.5. The 
number of US undergraduates should be about 8 million. 

For comparison, the 2010 US census data gives 5.361 million enrolled in 
four-year colleges, and 4.942 million in two-year colleges, for a combined 
total of almost exactly 10 million. Our estimate, which lies halfway between 
the four- year and the combined total, is quite good! 

Even had it been terrible, with a large discrep- « 
ancy between the estimate and the actual num- 'u 
ber, we would still have learned useful informa- T 
tion about a society. As an example, let's use the c 
same method to estimate how many university 
students graduated in 1950 in the United King- 65 years 

dom. ("College" in American English and "uni- 
versity" in British English are roughly equivalent.) The UK population at 
the time was 50 million. With a life expectancy of, say, 65 years, the rectan- 
gle's height is roughly 800 000 per year. If, as in the United States of today, 

50 percent of the age-eligible population goes to university, 400 000 should 
graduate each year. However, in 1950, the actual number was roughly 
17 000 — more than a factor of 20 smaller than the estimate. 



800000/year 



area = 50 million 
(UK in 1950) 



Such a huge error probably does not come from the population estimate. 
Instead, the fraction of 50 percent of university participation must be far 
too high. Indeed, rather than 50 percent, the actual figure for 1950 is 3.4 
percent. The difference between 50- and 3-percent university participation 
makes the United Kingdom of 1950 a very different society from the United 
States of 2010 or even from the United Kingdom of 2010, where the fraction 
going to university is roughly 40 percent. 
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6.4.3 Cone free-fall time and distance 

For our next illustration of graph lumping, let's turn from the social to the 
physical world, to our falling cones. For the single cone of Section 3.5.2 and 
the cone race of Section 4.3.1, we assumed that the cones made their entire 
journey at their terminal speed. But this assumption cannot be exact: Just 
after the cones are released, their fall speed is zero. Graph lumping will 
help us evaluate and refine this assumption. 

Hozvfnr does the cone foil before it reaches its terminal speed? 

Taken literally, the answer is infinity, because no object reaches its termi- 
nal speed. As it approaches its terminal speed, the drag and weight more 
nearly balance, the force and acceleration get closer to zero, and the speed 
changes ever more slowly. So an object can approach its terminal speed 
only ever more closely. Let's therefore rephrase the question to ask how 
far the cone falls before it has reached a significant fraction of its terminal 
speed. In the "significant," you can see the door through which we will 
bring in lumping. 

Here is a sketch of the actual fall speed versus time. 

At first, the speed increases rapidly. As the speed in- 
creases, so does the drag. The net force and the accel- 
eration fall, so the speed increases more slowly. As 
a lumping approximation, we'll replace the smooth 
curve by a slanted and a horizontal segment. 

A partly triangular lumping approximation might 
look different from the population lumping rectan- 
gles we constructed in Section 6.4.2. However, it is 
merely the integral of a rectangle: It is equivalent to 
replacing the actual, complicated acceleration by a 
rectangle representing the period of free fall. Then 
the acceleration drops abruptly to zero. 

Just as we labeled the population rectangle with its 
height and width, here we label the velocity graph 
with speeds, times, and slopes. The graph has two 
segments. The second, horizontal segment shows 
the cone's falling at its terminal speed u term . This 
speed, as we found experimentally in Section 3.5.2, 
is roughly 1 meter per second. 



lumped 




/* ^term ^ 1 n\ 



0.1 s 
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The first, slanted segment represents free fall, as if there were no air resis- 
tance. The free-fall acceleration is g, so the free-fall velocity has slope g: In 
1 second, the speed increases by 10 meters per second. The slanted and hor- 
izontal segments meet where gt = u term . Thus, they meet at approximately 
0.1 seconds. Based on the lumping approximation, we expect the cone to 
reach a significant fraction of its terminal speed by 0.1 seconds — which is 
only 5 percent of the total fall time of 2 seconds. 

Roughly how far does the cone fall in this time? 



Use lumping again! The distance is the area of the 
shaded triangle. The triangle's base is 0.1 seconds, 
and its height is the terminal speed, approximately 
1 meter per second. So its area is 5 centimeters: 



1 

2 



x 0.1 s x 1 ms 1 



5 cm. 



(6.34) 




After only 2.5 percent of its 2-meter journey, the cone has reached a sig- 
nificant fraction of the terminal speed. The "always at terminal speed" ap- 
proximation is quite accurate. Thanks to the lumping, we could judge the 
approximation without setting up or solving differential equations. 

Problem 6.17 Raindrop terminal speed 

Sketch the fall speed of a large raindrop versus time. Roughly how long and how 
far does it fall before it reaches (a significant fraction of) its terminal speed? 

Problem 6.18 Actual cone fall speed 

Set up and solve the differential equations for the fall of a cone with drag propor- 
tional to v 2 and terminal speed of 1 meter per second. What is its speed as a fraction 
of I’term after the cone (a) has fallen for 0.1 seconds, or (b) has fallen 5 centimeters? 



6.4.4 How viscosity burns up energy 

Lumping is particularly useful for gaining insight into fluid flow, a subject 
where the governing equations, the Navier-Stokes equations, are so com- 
plex that almost no problem has an exact solution. These equations were 
introduced in Section 3.5 to encourage you to use conservation reasoning. 
Here their specter is invoked to encourage you to use lumping. 

In everyday life, an important feature of fluid flow is drag. As we discussed 
in Section 5.3.2, drag (in steady-state flow) results from viscosity: Without 
viscosity, there can be no drag. Somehow, viscosity dissipates energy. 
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Using graph lumping, we can understand the mechanism of energy dissi- 
pation without having to understand the detailed physics of viscosity The 
essential physical idea is that the viscous force, a force from a neighboring 
region of fluid, slows down fast pieces of fluid and speeds up slow pieces. 

As an example, the arrows in the velocity-profile diagram 
show a fluid's horizontal speed above a flat boundary — for 
example, the air speed above a frozen lake. Farther above 
the boundary, the arrows are longer, indicating that the fluid 
moves faster. In a high-viscosity liquid, such as honey, the viscous forces 
are large, and they force nearby regions move at nearly the same speed: The 
flow oozes. 

Here is the lumped version of the velocity profile. The vary- 
ing velocity has been replaced by two rectangles correspond- 
ing to two chunks of fluid. The top chunk is moving faster 
than the bottom chunk; because of the velocity difference, 
each chunk exerts a viscous force on the other. 

Let's see what consequence this pair of forces has for the total energy of 
the chunks. To avoid cluttering the essential idea in the analysis with unit 
algebra, let's give the chunks concrete velocities and masses in a simple 
unit system. In this unit system, both chunks will have unit mass. The top 
chunk will move at speed 6 and the bottom chunk at speed 4. 

After a long time, how will viscosity have affected the velocities of the two stabs? 

Because momentum is conserved, the final speeds sum to 10, as they do at 
the start. Because of the viscous force between the chunks, the top chunk 
slows down, and the bottom chunk speeds up. The final speeds, once vis- 
cosity has done its work, literally and figuratively, are 5 and 5. 

The total kinetic energy of the two chunks starts at 26: 

0.5 x 1 x (6 2 + 4 2 ) = 26. (6.35) 

However, their final kinetic energy is only 25: 

0.5 x 1 x (5 2 + 5 2 ) = 25. (6.36) 

Simply because the velocities equalized, the kinetic energy fell! This reduc- 
tion happens no matter what the initial velocities are (Problem 6.19). The 
energy difference turns into heat. Our simple physical model, based on 
lumping, is that viscosity burns up energy by equalizing velocities. 
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Problem 6.19 Generalizing the argument to other initial velocities 

Let the initial velocities of the two chunks be and v 2 , where v-f ± v 2 - Show that 
the initial kinetic energy is greater than the final kinetic energy. 

Problem 6.20 Shortest-time path 

The classic problem in the calculus of variations is to find the 
shortest-time path for a mass sliding without friction between 
two points. This path is called the brachistochrone. A surpris- 
ing conclusion of the full analysis is that this path need not be straight. You can 
make this result plausible by using shape lumping: Rather than considering all 
possible paths, consider only paths with one corner. Under what conditions is 
such a path faster than the straight path (zero corners)? 




6.4.5 Mean free path 

Lumping smooths out variation. In the famous words of Isaiah 40:4, the 
crooked shall be made straight, and the rough places plain. We've used this idea 
in Section 6.4.2 to simplify a function of time (population versus age) and in 
Section 6.4.4 to simplify a function of space (a velocity profile). Now we'll 
combine both kinds of lumping to understand and estimate a mean free 
path. The mean free path is the average distance that a gas molecule travels 
before colliding with another molecule. As we will find in Section 7.3.1 us- 
ing probabilistic reasoning, mean free paths determine important material 
properties, including viscosity and thermal conductivity. 

Let's first estimate the mean free path in the simplest model: a spherical 
molecule moving in a gas of point molecules. The sphere will have radius r 
and the gas molecules a number density n . After analyzing this simplified 
situation, we'll replace the point molecules with more realistic spherical 
molecules. 

The moving molecule has a cross-sectional area a = nr 1 , 
and it sweeps out a tube with the same cross-sectional area 
(analogous to the tube in our analysis of drag via conserva- 
tion of energy in Section 3.5.1). 

How far does the spherical molecule travel down the tube before 
molecule? 

This distance is the mean free path A . However, finding it is complicated 
because the point molecules, always in motion, keep exiting and entering 
the tube. Let's simplify. First, lump in time: Freeze the motion, and pretend 




mean free path A 
it hits a point 
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that the point molecules hold those positions. Then lump in space: Rather 
than spreading the molecules throughout the tube, place them at the far 
end, at a distance A from the near end. 

The tube, whose length is the mean free path A, should be just long enough 
so that the tube contained, before lumping, one point molecule. Then, in 
the lumped model with that one molecule sitting at the end of the tube, the 
spherical molecule will hit one molecule after traveling a distance A . 

The number of molecules in the tube is tier A : 



number - number density x volume = ncrA. 

n u\ 

Therefore, A is determined by the requirement that 
ncrA ~ 1. 

For motion through the gas of point molecules, cr is nr 2 , so 

A (6.39) 

nnr z 

To make the model more realistic, let's replace the point 
molecules with spherical molecules. For simplicity, and 
to model the most frequent case, these molecules will also 
have radius r. 

How does this change affect the mean free path? 



(6.37) 



(6.38) 



A = n(2r) 2 



Now a collision happens if the centers of two molecules approach within a 
distance d = 2 r, the diameter of the sphere. Thus, a — called the scattering 
cross section — becomes n(2r) 2 or nd 2 . The mean free path is then a factor 
of 4 smaller and is 



A 



1 

nnd 2 



(6.40) 



Let's evaluate A for an air molecule traveling in air. For the diameter d, 
use the typical atomic diameter of 3 angstroms or 3 x 10~ 10 meters, which 
also works for small molecules (like air and water). To estimate the number 
density n, use the ideal gas law in the form that, at standard temperature 
and pressure, 1 mole occupies 22 liters. Therefore, 

1 _ 22 l 

n 6 x 10 23 molecules' 



and the mean free path is 



(6.41) 
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~ 2.2 x 10~ 2 m 3 1 

6xl0 23 n x (3 x 10 -10 m) 2 

To evaluate this expression in your head, divide the calculation into three 

steps — doing the most important first. 

1. Units. The numerator contains cubic meters; the denominator contains 
square meters. Their quotient is meters to the first power — as it should 
be for a mean free path. 

2. Powers of ten. The numerator contains —2 powers of ten. The denom- 
inator contains 3 powers of ten: 23 in Avogadro's number and —20 in 
(10 -10 ) 2 . The quotient is —5 powers of ten. Along with the units, the 
expression so far is 10 -5 meters. 



3. Everything else. The remaining factors are 

2.2 

6 x n x 3x3' 



(6.43) 



The 2.2/6 is roughly 10 0 5 . The three factors of 3 (one contributed by 
tc) are 10 15 . Therefore, the remaining factors contribute 10 -2 . 

Putting the pieces together, the mean free path becomes 10 -7 meters, which 
is 100 nanometers — quite close to the true value of 68 nanometers. 



Problem 6.21 Electric field due to a uniform sheet of charge 

In this problem, you investigate a physical model to estimate 
the electric field above a uniform sheet of charge with areal 
charge density a. 

a. Explain why the electric field must be vertical. 

b. At a height z above the sheet, what region of the sheet con- 
tributes significantly to this field? 

c. By lumping the sheet into a point charge with the same charge as the significant 
region, estimate the electric field at a height z, and give the scaling exponent n 
in E oc z”. 

Confirm your result by comparing it to the prediction from dimensional analysis 
(Problem 5.34). 

Problem 6.22 Electric field inside a spherical shell 

Another puzzling electrostatic phenomenon is that the electric field from a uni- 
form shell of charge is zero everywhere inside the shell. At the center, the field must 
be zero by symmetry. However, even away from the center, the field is still zero. 
Explain this phenomenon using a physical model and lumping. 
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6.4.6 Lumping the path of light bent by gravity 



Lumping, as we have seen, replaces a complex, changing process with a 
simpler, constant process. In the next example, we'll use that simplification 
to build and analyze a physical model for the bending of starlight by the 
Sun. In Section 5.3.1, using dimensional analysis and educated guessing, 
we concluded that the bending angle is roughly Gm/rc 2 , where m is the 
mass of the Sun and r is the distance of closest approach (for a ray grazing 
the Sun, it is just the radius of the Sun). Lumping provides a physical model 
for this result; this model will, in Section 8. 2. 2. 2, allow us to predict the 
effect of extremely strong gravitational fields. 



Imagine again a beam (or photon) of light that leaves a distant star. In its 
journey, it grazes the surface of the Sun and reaches our eye. To estimate 
the deflection angle using lumping, first identify the changing process — the 
source of the complexity. Here, the light beam deflects from its original, 
straight path and the gravitational force from the Sun changes in magnitude 
and direction as the photon travels. Therefore, calculating the deflection an- 
gle requires setting up and evaluating an integral — and carefully checking 
its trigonometric factors, such as the cosines and secants. 

The antidote to complicated integrals is lumping. The pill IT 

lumping approximation simply pretends that the beam f ' ' 

bends only near the Sun. In this approximation, only ' v / 

near the Sun does gravity operate. We further assume 
that, while the photon is near the Sun, its downward acceleration (the accel- 
eration perpendicular to the path) is constant, rather than varying rapidly 
with position. 

The problem then simplifies to estimating the de- 
flection while the beam is near the Sun. As a fur- 
ther lumping approximation, let's define "near" 
to mean "within r on either side of the location 
of closest approach." The justification is dimen- 
sional: The only length in the problem is the dis- 
tance of closest approach, which is r; therefore, "near" and "far" are defined 
relative to the characteristic distance r. 




near 

zone 



The geometry is simplest at the point of closest approach. Therefore, let's 
make the further lumping approximation that, although the light beam 
tracks how much downward deflection should happen, the total deflection 
happens only at the point of closest approach. 
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kink c 

velocity before deflection 



This lumped path, instead of changing direction smoothly, has a kink (a 
corner) at the point of closest approach. 

The deflection angle, in the small-angle approximation, is the ratio of veloc- 
ity components: 
v, 

6 « — , ( 6 . 44 ) 

C 

where c is the speed of light, which is the forward velocity, and is the 
accumulated downward velocity. This downward velocity comes from the 
downward acceleration due to the Sun's gravity. 

In the exact analysis, the downward acceleration 
varies along the path. As a function of time, it 
looks like a bell curve centered at the point of 
closest approach (labeled as t = 0, for symmetry). 

The downward velocity is the integral of (the area 
under) the entire a^(t) curve. 





Our lumping analysis replaces the entire area by 
a rectangle centered around the peak. This rec- 
tangle has height Gm/r 2 : the characteristic (and 
peak) downward acceleration. It has width com- 
parable to r/c: the time that the beam spends near 
the Sun. Therefore, its area, which is the lumping 
approximation to v^, is roughly Gm/rc: 



lumped « j 




Pj ~ characteristic downward acceleration x deflection time. ( 6 . 45 ) 
Gm/rc Gm/r 2 ~r/c 



The deflection angle is therefore comparable to Gm/rc 2 : 

v i 

a Gm/rc Gm 

o — — r-. 

c rc L 



( 6 . 46 ) 



The lumping argument explains, with a physical model, the deflection an- 
gle that we predicted using dimensional analysis and educated guessing 
(Section 5.3.1). Lumping once again complements dimensional analysis. 
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Problem 6.23 Sketching the actual and lumped deflection angle 

On axes for cumulative deflection 8 versus distance along the beam, sketch (a) the 
actual curve, (b) the lumped curve, assuming that the deflection happens only 
while the beam is near the Sun, and (c) the lumped curve, assuming, as in the 
text, that the deflection happens only at the point of closest approach. 



6.4./ All-or-nothing reasoning: Solid mechanics by lumping 



For estimating the bending of light, the heart of the lumping analysis was 
all-or-nothing reasoning: replacing the complex, varying downward accel- 
eration with a simpler curve that was either zero or a nonzero constant. To 
practice this idea, we'll apply it to an example from solid mechanics, also a 
subject fraught with differential equations. In particular, we'll estimate the 
contact radius of a solid ball resting on the ground. 



We know a bit about the contact radius: In Problem 5.50, you 
used dimensional analysis to find that the contact radius r is 
given by 





where R is the ball's radius, p is its density, Y is its Young's 
modulus, and / is a dimensionless function. The function f 
is not determined by dimensional analysis, which is purely mathematical 
reasoning. Finding f requires a physical model; the easiest way to make 
and analyze such a model is by making lumping approximations. 



Physically, the ground compresses the tip of the ball by a 
small distance <5, making a flat circle of radius r in contact 
with the ground. The ball fights back, trying to restore its 
natural, spherical shape. When the ball rests on the table, 
the restoring force equals its weight. This constraint will 
give us enough information to find the dimensionless func- 
tion /. 

The restoring force comes from the stress (or pressure) over 
the contact surface. To estimate this stress, let's make the lumping approx- 
imation that it is constant over the contact surface and equal to a typical 
or characteristic stress. This approximation is analogous to replacing the 
varying population curve with a constant value (and making a rectangle). 
With that approximation, the restoring force is 
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force ~ typical stress x contact area. 



( 6 . 48 ) 



We can estimate the stress based on the strain (the fractional compression). 
The stress and strain are related by the Young's modulus Y : 

typical stress ~ Y x typical strain. ( 6 . 49 ) 



The strain, like the stress, varies throughout the ball. In the lumping or 
all-or-nothing approximation, the strain becomes a characteristic or typical 
strain in a region around the contact surface, and zero outside this region. 
The typical strain is a fractional length change: 



strain = 



length change 
length 



( 6 . 50 ) 



The numerator is <5: the amount by which the tip of the ball is compressed. 
The denominator is the size of the mysterious compressed region. 



How large is the compressed region? 



Because the ball's radius or diameter changes, the compressed region might 
be the whole ball. This tempting reasoning turns out to be incorrect. To 
see why, imagine an extreme case: compressing a huge, 10-meter cube of 
rubber (rubber, because one can imagine compressing it). By pressing your 
finger onto the center of one face, you'll indent the face by, say, 1 millimeter 
over an area of 1 square centimeter. In the notation that we use for the ball, 
5 ~ 1 millimeter, r ~ 1 centimeter, and R ~ 10 meters. 

The strained region is not the whole cube, nor any significant fraction of 
it! Rather, its radius is comparable to the radius of the contact region — a 
fingertip (r ~ 1 centimeter). From this thought experiment, we learn that 
when the object is large enough ( R » r), the strained volume is related not 
to the size of the object, but rather to the size of contact region (r). 

Therefore, in the estimate for the typical strain, the length in the denomina- 
tor is r. The typical strain e is then 5/r. 

Because the compression <5, in contrast to the contact radius r and the ball's 
radius R, is not easily visible, let's rewrite 5 in terms of r and R. Amazingly, 
they are related by a geometric mean, because their geometry reproduces 
the geometry of the horizon distance (Section 2.3). The compression <5 is 
analogous to one's height above sea level. The contact radius r is analogous 
to the horizon distance. And the ball's radius R is analogous to the Earth's 
radius. 
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Just as the horizon distance is roughly the geometric mean of the two sur- 
rounding lengths, so should the contact radius be roughly the geometric 
mean of the compression length and the ball's radius. On a logarithmic 
scale, r is roughly halfway between <5 and R. (Like the horizon distance, r is 
actually halfway between 6 and the diameter 2 R. However, the factor of 2 
does not matter to this lumping analysis.) 



tip compression 



contact radius 
I 



ball's radius 



<5 



r 



R 



Therefore, S/r, which is the typical strain, is also roughly r/R. 
The restoring force is therefore comparable to Yr 3 /R: 



typical stress 



restoring force ~ Y x typical strain x contact area. 



Yr 3 /R r/R r 2 

This force balances the weight, which is comparable to pgR 3 : 
Yr 3 

IT 



pgR 3 . 



Therefore, in dimensionless form, the contact radius is given by 



r 

R 



T) 



1/3 



and the dimensionless function / in 




is a cube root (multiplied by a dimensionless constant). 
How large is the contact radius r in practice? 



( 6 . 51 ) 



( 6 . 52 ) 



( 6 . 53 ) 



( 6 . 54 ) 



Let's plug in numbers for a superball (a small, highly elastic rubber ball) 
resting on the ground. The density of rubber is roughly the density of water. 
A superball is small — say, R ~ 1 centimeter. Its elastic modulus is roughly 
Y ~ 3 x 10 7 pascals. (This elastic modulus is a factor of 300 smaller than 
oak's and almost a factor of 10 4 smaller than steel's.) Then 
P g R 



r 

R 



10 3 kg/m J x 10ms 2 x 10 2 m 

3 x 10 7 Pa 

y 



1/3 



( 6 . 55 ) 




228 



6 Lumping 



The quotient inside the parentheses is 10 . Its cube root is roughly 10 2 , 

so r/R is roughly 10 -2 , and r is roughly 0.1 millimeters (a sheet of paper). 

From this example, we see how lumping builds on dimensional analysis. 
Dimensional analysis gave the form of the relation between r/R and pgR/Y. 
Lumping provided a physical model that specified the particular form. 

Problem 6.24 Compression of the tip 

For the superball, estimate 8, the compression of its tip when it is resting on the 
ground. 

Problem 6.25 Contact radius of a marble 

Estimate the contact radius of a small glass marble sitting on a (very) hard table. 

Problem 6.26 Energy argument 

The potential-energy density due to deforming a solid is comparable to Ye 2 . Using 
this relation, find a second explanation for why r/R is comparable to ( pgR/Y ) 11 3 . 

Problem 6.27 Mountain heights 

On Earth, the tallest mountain (Everest) is 9 kilometers high. On Mars, the tallest 
mountain (Olympus) is 27 kilometers high. Make a lumped model of a mountain 
to explain the factor-of-3 difference. If the same reasoning applied to the Moon, 
how high would its tallest mountain be? Why is it so much shorter? 

Problem 6.28 Asteroid shapes 

The tallest mountain on Earth is much smaller than the Earth. Using the moun- 
tain-height data of Problem 6.27, estimate the maximum radius of a planetary body 
made of rock (like the Earth or Mars) that has mountains comparable in size to the 
body. What consequence does this size have for the shapes of asteroids? 

Problem 6.29 Contact time 

Imagine a ball dropped from a height, hitting a hard table with impact speed v. 
Find the scaling exponent /3 in 




where t is the contact time and c s is the speed of sound in the ball. Then write 
t in the form t ~ R/n effective , where i’ e ff e ctive is a weighted geometric mean of the 
impact speed z> and the sound speed c s . 

Problem 6.30 Contact force 

For a small steel ball bouncing from a steel table with impact speed 1 meter per 
second (Problem 6.29), how large is the contact force compared to the ball's weight? 
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6.5 Quantum mechanics 

When we introduced quantum mechanics to estimate the size of hydrogen 
(Section 5.5.1), it was an actor in dimensional analysis. There, quantum 
mechanics merely contributed a new constant of nature h. This contribution 
was mathematical. By using lumping to introduce quantum mechanics, we 
will gain a physical intuition for the effect of quantum mechanics. 

6.5.1 Particle in a box: Size of neutron stars 

In mechanics, the simplest useful model is motion in a straight line at 
constant acceleration (which includes constant velocity). This model 
underlies numerous analyses — for example, the lumping analysis of a 
pendulum period (Section 6.3.3). In quantum mechanics, the simplest 
useful model is a particle confined to a box. Let's give the particle a 
mass m and the box a width n. 

What is the lowest possible energy of the particle — that is, its ground-state 
energy? 

Because this box can be a lumping model for more complex problems, this 
energy will help us explain the binding energy of hydrogen. A lumping 
analysis starts with Heisenberg's uncertainty principle: 

ApAx ~ h. ( 6 . 57 ) 

Here, A p is the spread (the uncertainty) in the particle's momentum. Ax is 
the spread in its position, and h is the quantum constant. If we know the po- 
sition precisely (that is, if the uncertainty Ax is tiny), then the momentum 
uncertainty A p must be large in order to make the product ApAx compa- 
rable to h. In contrast, if we hardly know the position (Ax is large), then 
the momentum uncertainty A p can be small. This relation is the physical 
contribution of quantum mechanics. 

Let's apply it to the particle in the box. The particle could be anywhere in 
the box, so its position uncertainty Ax is comparable to the box width a : 

Ax ~ a. ( 6 . 58 ) 

Because of the uncertainty principle, the consequence of confining the par- 
ticle to the box implies a momentum uncertainty h/a: 

h_ ~h 
Ax a ' 




A p 



( 6 . 59 ) 
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This momentum corresponds to a kinetic energy E ~ (A p) 2 /m. As a conse- 
quence, the particle acquires an energy, called the confinement energy: 

h 2 

E (6.60) 

ma z 

This simple result enables us to estimate the radius of a neutron star: a 
star without any nuclear fusion (which normally resists gravity) and heavy 
enough that gravity has collapsed the protons and electrons into neutrons. 
The atomic structure is gone, and the star is one giant, neutral nucleus (ex- 
cept in the crust, where gravity is not strong enough to make neutrons). 

Problem 6.31 Dimensional analysis for the size of a neutron star 

Use dimensional analysis to find the radius R of a neutron star of mass M. 

Two physical effects compete: Gravity tries to crush the star; and quantum 
mechanics, armed with the uncertainty principle, resists by penalizing con- 
finement. To begin quantifying these effects, let the star have radius R and 
contain N neutrons, so it has mass M = Nm n . 

Gravity's contribution to crushing the star is reflected in the star's gravita- 
tional potential energy 

^potential ~ • (6.61) 



This energy is negative: Gravity is happier (the energy is lower) when R is 
smaller. Here, the minus sign is incorporated into the single approximation 
sign-. 



For the other side of the competition, we confine each neutron to a box. To 
find the box width a, imagine the star as a three-dimensional cubic lattice of 
neutrons. Because the star contains N neutrons, the lattice is N 1/5 x N 1/3 x 
N 1/3 . Each side has length comparable to R, so aN 1/5 ~ R and a ~ R/N 1/3 . 

Each neutron gets a confinement (kinetic) energy h 2 /m n a 2 . For all the N 
neutrons together, and using a ~ R/N 1/3 , 



E 



kinetic 



N 



h 2 



m„a z 



Af5/3 



h 2 

m n R 2 



(6.62) 



In terms of the known masses M and m n (instead of N ), the total confine- 
ment energy is 

r M 5/3 h 2 

T kinetic ~ m 8/3 p 2 ‘ 
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The kinetic energy is proportional to R~ 2 , whereas the poten- 
tial energy is proportional to R _1 . On log-log axes of energy 
versus radius, the kinetic energy is a straight line with a —2 
slope, and the potential energy is a straight line with a —1 
slope. Because the slopes differ, the lines cross. The radius 
where they cross, which is roughly the radius that minimizes 
the total energy, is also roughly the radius of the neutron 
star. (We used the same reasoning in Section 4.6.1, to find 
the speed that minimized the energy required to fly.) 




With all the constants included, the crossing happens when R satisfies 
GM 2 M 5/3 h 2 



R 



m 8/ 3 R 2 



(6.64) 



The radius of the neutron star is then 



R 



h 2 

GMh3 m 8/3' 



(6.65) 



If the Sun, with a mass of 2xl0 30 kilograms, became a neutron star, it would 
have a radius of roughly 3 kilometers: 

(lO - 34 kgm 2 s -1 ) 2 

7 x 10 -11 kg -1 m 3 s -2 x (2 x 10 30 kg) 1 ^ x (1.6 x 10 -27 kg) 8/3 (6-66) 
~ 3 km. 



The true neutron-star radius for the Sun, after including the complicated 
physics of a varying density and pressure, is approximately 10 kilometers. 
(In reality, the Sun's gravity wouldn't be strong enough to crush the elec- 
trons and protons into neutrons, and it would end up as a white dwarf 
rather than a neutron star. However, Sirius, the subject of Problem 6.32, is 
massive enough.) 



Here is a friendly numerical form that incorporates the 10-kilometer infor- 
mation. It makes explicit the scaling relation between the radius and the 
mass, and it measures mass in the convenient units of the solar mass. 



R w 10 km x 



Msun 

M 



1/3 



(6.67) 



Our analysis, which used several lumping approximations and neglected 
many dimensionless constants throughout, predicted a prefactor of 3 kilo- 
meters instead of 10 kilometers. An error of a factor of 3 is a worthwhile 
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tradeoff. We avoid the complicated analysis of a quantum fluid of electrons, 
protons, and neutrons, and still gain insight into the essential physical idea: 
The size of a neutron star results from a competition between gravity and 
quantum mechanics. (You will find many insightful astrophysical scalings 
in an article on the "Astronomical reach of fundamental physics" [5].) 

Problem 6.32 Sirius as a neutron star 

Sirius, the brightest star in the night sky, has a mass of 4 x 10 30 kilograms. What 
would be its radius as a neutron star? 



6.5.2 Hydrogen by lumping 



Having practiced quantum mechanics and lumping in the large (a neutron 
star), let's go to the small: We'll use lumping to complement the dimen- 
sional-analysis estimate of the size of hydrogen in Section 5.5.1. 



Its size is the radius of the orbit with the lowest energy. Because the energy 
is a sum of the potential energy from the electrostatic attraction and the 
kinetic energy from quantum mechanics, hydrogen is analogous to a neu- 
tron star, where quantum mechanics competes instead against gravitation. 
Partly because electrostatics is much stronger than gravity (Problem 6.34), 
hydrogen is very tiny. 



For now, let the hydrogen atom have an unknown radius r. 
As a lumping approximation, the complicated electrostatic 
potential becomes a box of width r. Then the electron's 
momentum uncertainty is Ap ~ h/r and its confinement 
energy is h 2 /m e r 2 : 



E 



kinetic 



(Ap ) 2 

m e 



h 2 

m e r 2 



( 6 . 68 ) 




This energy competes against the electrostatic energy, whose estimate also 
requires a lumping approximation. For in quantum mechanics, an electron 
is not at any definite location. Depending on your interpretation of quan- 
tum mechanics, and speaking roughly, it is either smeared over the box, 
or it has a probability of being anywhere in the box. On either interpreta- 
tion, calculating the potential energy requires an integral. However, we can 
approximate it by using lumping: The electron's typical, or characteristic, 
distance from the proton is simply r, the radius of hydrogen (which is the 
size of the lumping box). 
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Then the potential energy is the electrostatic energy of a proton and electron 
separated by r: 



^potential- ■ 



(6.69) 



The total energy is their sum 



E ^potential T E 



kinetic 



e 2 ft 2 
47re 0 r m e r 



(6.70) 



The electron adjusts r to minimize this energy. As we 
learned in the analysis of lift (Section 4.6.1), when two 
terms with different scaling exponents compete, the min- 
imum occurs where the terms are comparable. (In Sec- 
tion 8. 3. 2. 2, we'll return to this example with the addi- 
tional tool of easy cases and also sketch the energies on 
log-log axes.) Using the Bohr radius n 0 as the separation 
of minimum energy, comparability means 

e 2 h 2 

4:ne 0 a 0 m e a 

F F 

potential ^kinetic 




The Bohr radius is therefore 
h 2 

«o TYTa U (6 - 72) 

m e (e 2 /47re 0 ) 

as we found in Section 5.5.1 using dimensional analysis. Now we also get 
a physical model: The size of hydrogen results from competition between 
electrostatics and quantum mechanics. 



Problem 6.33 Minimizing the energy in hydrogen using symmetry 

Use symmetry to find the minimum-energy separation in hydrogen, where the 
total energy has the form 

A B 

-S' - -• (6-73) 

r z r 

(See Section 3.2.3 for related examples.) 

Problem 6.34 Electrostatics versus gravitation 

Estimate the ratio of the gravitational to the electrostatic force in hydrogen. What 
are the similarities and differences between this estimate and the estimate that you 
made in Problem 2.15? 
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6.6 Summary and further problems 

Lumping is our first tool for discarding complexity with loss of information. 
By doing so, it simplifies complicated problems where our previous set of 
tools could not. Curves become straight lines, calculus becomes algebra, 
and even quantum mechanics becomes comprehensible. 

Problem 6.35 Precession of the equinoxes 

Because the Earth is oblate (Problem 5.46), and thus 
has a bit of fat near the equator, and because its rota- 
tion axis is tilted relative to the Earth's orbital plane, 
the Sun and the Moon each exert a slight torque on 
the Earth. This torque slowly rotates (precesses) the 
Earth's axis of rotation (so the north star will not al- 
ways be the north star). 

a. Explain why F j and F 2 , the gravitational forces 
on the two bulges, are almost exactly equal in di- 
rection but not equal in magnitude. 

b. Use lumping to find the scaling exponents x and y in 

torque oc m x l y , (6.74) 

where m is the mass of the object (either the Sun or Moon) and 1 is its distance 
from the Earth. Then estimate the ratio 
torque from the Sun 

5 ; — . (6.75) 

torque trom the Moon 

c. By including the constants of proportionality, estimate the total torque and the 
precession rate. 

Problem 6.36 Graph lumping in reverse 

In a constant-temperature (isothermal) atmosphere, which isn't a terrible approxi- 
mation to the actual atmosphere, the air density falls exponentially with height: 

p = p 0 e~ z/H , (6.76) 

where z is the height above sea level, p 0 is the density at sea level, and H is the 
scale height of the atmosphere. We estimated H in Section 5.4.1 using dimensional 
analysis; in this problem, you'll estimate it by using lumping in reverse. 

a. Sketch p(z) versus z as given above. On the same graph, sketch a lumped p(z) 
that represents a sea-level-density atmosphere for z < H and zero density for 
z > H. Ensure that your lumping rectangle has the same area as the exponen- 
tially decaying p(z) curve. 

b. Use your lumping rectangle and the sea-level pressure p 0 to estimate H. Then 
estimate the relative density at the top of Mount Everest (roughly 9 kilometers). 
Check your estimate by looking up the actual air density on Mount Everest. 
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Our previous tool, lumping, helps us simplify by discarding less important 
information. Our next tool, probabilistic reasoning, helps us when our in- 
formation is already incomplete — when we've discarded even the chance 
or the wish to collect the missing information. 



7.1 Probability as degree of belief: Bayesian probability 

The essential concept in using probability to simplify the world is that prob- 
ability is a degree of belief. Therefore, a probability is based on our knowl- 
edge, and it changes when our knowledge changes. 

7.1.1 Is it my telephone number? 

Here is an example from soon after I had moved to England. I was talking 
to a friend on the phone, of the old-fashioned variety with wires connecting 
it to the wall. David needed to call me back. However, having just moved 
to the apartment, I was unsure of my phone number; plus, for anyone used 
to American phone numbers, British phone numbers have a strange and 
hard-to-remember format. I had a reasonably likely guess, which I gave 
David so that he could call me back. After I hung up, I tested my guess by 
picking up my phone and dialing my guess — and got a busy signal. 





236 



7 Probabilistic reasoning 



Given this experimental evidence, how sure am I that the candidate number is my 
phone number? Quantitatively, what odds should I give? 

This question makes no sense if probability is seen as long-run frequency. 
In that view, the probability of a coin turning up heads is 1 /2 because 1 /2 
is the limiting proportion of heads in an ever-longer series of tosses. How- 
ever, for evaluating the plausibility of the phone number, this interpreta- 
tion — called the frequentist interpretation — cannot apply, because there is 
no repeated experiment. 

The frequentist interpretation gets stuck because it places probability in 
the physical system itself. The alternative — that probability reflects the in- 
completeness of our knowledge — is known as the Bayesian interpretation 
of probability. It is the interpretation suited for mastering complexity. A 
book-length discussion and application of this fundamental point is Edwin 
Jaynes's Probability Theory: The Logic of Science [26]. 

The Bayesian interpretation is based on one simple idea: A probability re- 
flects our degree of belief in a hypothesis. Probabilities are therefore sub- 
jective: Someone with different knowledge will have different probabili- 
ties. Thus, by collecting evidence, our degrees of belief change. Evidence 
changes probabilities. 

In the phone-number problem, what is the hypothesis and what is the evidence? 

The hypothesis — often denoted H — is the statement about the world whose 
credibility we would like to judge. Here, 

H = My phone-number guess is correct. (7.1) 

The evidence — often denoted E or D (for data) — is the information that we 
collect, obtain, or learn and then use to judge the hypothesis. It augments 
our knowledge. Here, E is the result of the experiment: 

E = Dialing my guess gave a busy signal. (7.2) 

Any hypothesis has an initial probability Pr (H ) . This probability is called 
the prior probability, because it is the probability prior to, or before, incor- 
porating the evidence. After learning the evidence E, the hypothesis has a 
new probability Pr (H | E): the probability of the hypothesis H given — that 
is, upon assuming — the evidence E. This probability is called the posterior 
probability, because it is the probability, or degree of belief, after including 
the evidence. 
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The recipe for using evidence to update probabilities is Bayes' theorem: 

Pr (H | E) oc Pr (H) x Pr (E \ H) . (7.3) 

The new factor, the probability Pr (£ | H ) — the probability of the evidence 
given the hypothesis — is called the likelihood. It measures how well the 
candidate theory (the hypothesis) explains the evidence. Bayes' theorem 
then says that 

posterior probability oc prior probability x explanatory power. (7.4) 
Pr(H|E) Pr (H) Pr(E | H) 



(The constant of proportionality is chosen so that the posterior probabilities 
for all the competing hypotheses add to 1.) Both probabilities on the right 
are necessary. Without the likelihood, we could not change our probabili- 
ties. Without the prior probability, we would always prefer the hypothesis 
with the maximum likelihood, no matter how contrived or post hoc. 



In a frequent use of Bayes' theorem, there are only two hypotheses, H and 
its negation H. In this problem, H is the statement that my guess is wrong. 
With only two hypotheses, a compact form of Bayes' theorem uses odds 
instead of probabilities, thereby avoiding the constant of proportionality: 



posterior odds = prior odds x 

0(H | E) 0(H) 



Pr(E|H) 
Pr (E |H) " 



(7.5) 



The odds O are related to the probability p by O = p/(l — p). For example, 
a probability of p = 2/3 corresponds to an odds of 2 — often written as 2:1 
and read as "2-to-l odds." 



Problem 7.1 Converting probabilities to odds 

Convert the following probabilities to odds: (a) 0.01, (b) 0.9, (c) 0.75, and (d) 0.3. 

Problem 7.2 Converting odds to probabilities 

Convert the following odds to probabilities: (a) 3, (b) 1/3, (c) 1:9, and (d) 4-to-l. 

The ratio Pr (E | H) /Pr (E | H) is called the likelihood ratio. Its numerator 
measures how well the hypothesis H explains the evidence E; its denomi- 
nator measures how well the contrary hypothesis H explains the same ev- 
idence. So their ratio measures the relative explanatory power of the two 
hypotheses. Bayes' theorem, in the odds form, is simple: 

updated odds = initial odds x relative explanatory power. (7.6) 
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Let's use Bayes' theorem to judge my phone-number guess. Before the ex- 
periment, I was not too sure of the phone number; Pr (Ef) is perhaps 1/2, 
making O(H) = 1. In the likelihood ratio, the numerator Pr (E | Ef) is the 
probability of getting a busy signal assuming ("given") that my guess is cor- 
rect. Because I would be dialing my own phone using my phone, I would 
definitely get a busy signal. Thus, Pr (E | El) = 1: The hypothesis of a cor- 
rect guess ( H ) explains the data as well as possible. 

The trickier estimate is the denominator Pr (E |H): the probability of get- 
ting a busy signal assuming that my guess is incorrect. I'll assume that my 
guess is still a valid phone number (I nowadays rarely get the recorded mes- 
sage saying that I have dialed an invalid number). Then I would be dialing 
a random person's phone. Thus, Pr (E | H) is the probability that a random 
valid phone is busy. It is probably similar to the fraction of the day that my 
own phone is busy. In my household, the phone is in use for 0.5 hours in a 
24-hour day, and the busy fraction could be 0.5/24. 



However, that estimate uses an overly long time, 24 hours, for the denomi- 
nator. If I do the experiment at 3 am and my guess is wrong, I would wake 
up an innocent bystander. Furthermore, I am not often on the phone at 
3 am. A more reasonable denominator is 10 hours (9 am to 7 pm), making 
the busy fraction and the likelihood Pr (E | H) roughly 0.05. An incorrect 
guess (H) is a lousy explanation for the data. 



The relative explanatory power of H and H, which is measured by the like- 
lihood ratio, is roughly 20: 



Pr(E|H) ^ j_ 
Pr (E | H) ~ 0.05 



(7.7) 



Because the prior odds were 1 to 1, the updated, posterior odds are 20 to 1: 
posterior odds = prior odds x likelihood ratio ~ 20. (7.8) 

O(H|E)~20 0(H)~1 Pr(E | H) / Pr(E | H) ~ 20 



My guess has become very likely — and it turned out to be correct. 
Problem 7.3 PKU testing 

In most American states and many countries, newborn babies are tested for the 
metabolic defect phenylketonuria (PKU). The prior odds of having PKU are about 
1 in 10 000. The test gives a false-positive result 0.23 percent of the time; it gives a 
false-negative result 0.3 percent of the time. What are Pr (PKU | positive test) and 
Pr (PKU | negative test)? 
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7.2 Plausible ranges: Why divide and conquer works 

The Bayesian understanding of probability as degree of belief will show 
us why divide-and-conquer reasoning (Chapter 1) works. We'll see how it 
increases our confidence in an estimate and decreases our uncertainty as 
we analyze a divide-and-conquer estimate in slow motion. 

7.2.1 Land area of the United Kingdom 

The estimate will be the land area of the United Kingdom, which is where 
I was born and later spent many years. So I have some implicit knowledge 
of the area, but I don't know it explicitly. The estimate therefore serves as 
a model for the common situation where we know more than we think we 
do and need to bring out and use that knowledge. 

To make the initial estimate, the baseline against which to compare the di- 
vide-and-conquer estimate, I talk to my gut using the rubric of Section 1.6. 
For the lower end of my range, 10 4 square kilometers feels right: I would 
be fairly surprised if the area were smaller. For the upper end, 10 7 square 
kilometers feels right: I would be fairly surprised if the area were larger. 
Combining the endpoints, I would be mildly surprised if the area were 
smaller than 10 4 square kilometers or greater than 10 7 square kilometers. 
The wide range, spanning three orders of magnitude, reflects the difficulty 
of estimating an area without using divide-and-conquer reasoning. 

My confidence in the gut estimate is my probability for the hypothesis H : 
H = The UK's land area lies in the range 10 4 ...10 7 km 2 . (7.9) 

This probability assumes, or is based on, my background knowledge K: 

K = what I know about the area before using divide and conquer. (7.10) 

My confidence or degree of belief in the guess is the conditional probability 
Pr (H | K) : the probability that the area lies in the range, based on my knowl- 
edge before applying divide-and-conquer reasoning. Alas, no algorithm is 
known for computing a probability based on such complicated background 
information. The best that we can do is to introspect: to hold a further gut 
discussion. This discussion concerns not the area itself, but rather the de- 
gree of belief about the range 10 4 ...10 7 square kilometers. 

My gut chose the range for which I would feel mild surprise, but not shock, 
to learn that the area lies outside it. The surprise implies that the probability 
Pr (H | K) is larger than 1 /2: If Pr (hi | K) were less than 1 /2, I would be 
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surprised to find the area inside the range. The mildness of the surprise 
suggests that Pr (H | K) is not much larger than 1/2. The probability feels 
like 2/3. The corresponding odds are 2:1; I'd give 2-to-l odds that the area 
lies within the plausible range. With a further assumption of symmetry 
so that the area's falling below and above the range are equally likely, the 
plausible range represents the following probabilities: 




where the wavy lines at the ends indicate that the left and right ranges ex- 
tend down to zero and up to infinity respectively. 

Now let's see how a divide-and-conquer estimate changes 
the plausible range. To make this estimate, lump the area 
into a rectangle with the same area and aspect ratio as the 
United Kingdom. My own best-guess rectangle is super- 
imposed on the map outline of the United Kingdom. Its 
area is the product of its width and height. Then the area 
estimate divides into two simpler estimates. 

1. Lumped width. Before I ask my gut for the plausible 
range for the width, I prepare by reviewing my knowl- 
edge of the width. Crossing the United Kingdom in 
the south, say from London to Cornwall, takes maybe 
4 hours by car. But much of the United Kingdom is 
thinner, so the average, or lumped, width corresponds 
to maybe 3 hours of driving. My gut is content with 
the range 150. ..250 miles or 240. ..400 kilometers: 

240 km plausible range 400 km 

\ p~l/6 p » 2/3 p-1/6 ^ 

On a logarithmic scale, which is the correct scale for positive quantities 
such as the width and height, the midpoint of the range is 4 240 x 400 
or 310 kilometers. It is my best estimate of the width. 

2. Lumped height. The train journey from London in the south of England 
to Edinburgh in Scotland, in the northern part of the United Kingdom, 
takes about 5 hours at, say, 80 miles per hour. In addition to these 400 
miles, there's more latitude in Scotland north of Edinburgh and some 
in England south of London. Thus, my gut estimate for the height of 







width 



height 
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the lumping rectangle is 500 miles. This distance is the midpoint of my 
plausible range for the height. 

The estimate feels accurate to plus or minus 20 percent (roughly +100 
miles). On a logarithmic (or multiplicative) scale, the range would be 
roughly a factor of 1.2 on either side of midpoint: 

420 ... 600 miles. (7.11) 

500/1.2 500x1.2 

(A more accurate value for 500/1.2 is 417, but there's little reason to 
strive for such accuracy when these endpoints are rough estimates any- 
way.) In metric units, the range is 670. ..960 kilometers, with a midpoint 
of 800 kilometers. Here is its probability interpretation: 



670 km 
> p ~ 1/6 



plausible range 
2/3 



960 km 



p ~ 1/6 ^ 



The next step is to combine the plausible ranges for the height and 
the width in order to make the plausible range for the area. A first 
approach, because the area is the product of the width and height, 
is simply to multiply the endpoints of the width and height ranges: 

A min « 240 km x 670 km « 160 000 km 2 ; 

j (7.12) 

A max ~ 400 km x 960 km « 380 000 km 2 . 

The geometric mean (the midpoint) of these endpoints is 250 000 
square kilometers. 

Although reasonable, this approach overestimates the width of 
the plausible range — a mistake that we'll correct shortly. How- 
ever, even this overestimated range spans only a factor of 2.4, whereas my 
starting range of 10 4 ...10 7 square kilometers spans a factor of 1000. Di- 
vide-and-conquer reasoning has significantly narrowed my plausible range 
by replacing a quantity about which I have vague knowledge, namely the 
area, with quantities about which I have more precise knowledge. 

The second bonus is that subdividing into many quantities carries only a 
small penalty, smaller than suggested by simply multiplying the endpoints. 
Multiplying the endpoints produces a range whose width is the product of 
the two widths. But this width assumes the worst. To see how, imagine an 
extreme case: estimating a quantity that is the product of ten independent 
factors, each of which you know to within a factor of 2 (in other words, each 
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plausible range spans a factor of 4). Does the plausible range for the final 
quantity span a factor of 4 10 (approximately 10 6 )? That conclusion is terribly 
pessimistic. More likely, several of the ten estimates will be too large and 
several will be too small, allowing many errors to cancel. 

Correctly computing the plausible range for the area requires a complete 
probabilistic description of the plausible ranges for the width and height. 
With it we could compute the probability of each possible product. That 
description is, somewhere, available to the person giving the range. But 
no one knows how to deduce such a complete set of probabilities based on 
the complex, diffuse, and seemingly contradictory information lodged in a 
human mind. 

A simple solution is to specify a reasonable probability distribution. We'll 
use the log-normal distribution. This choice means that, on a logarithmic 
scale, the probability distribution is a normal distribution, also known as a 
Gaussian distribution. 

Here's the log-normal distribution for my gut estimate 
of the height of the UK lumping rectangle. The horizon- 
tal axis is logarithmic: Distances correspond to ratios 
rather than differences. Therefore, 800 rather than 815 
kilometers lies halfway between 670 and 960 kilometers. 

These endpoints are a factor of 1.2 smaller or larger than 
the midpoint. The peak at 800 kilometers reflects my belief that 800 kilome- 
ters is the best guess. The shaded area of 2/3 quantifies my confidence that 
the true height lies in the range 670. ..960 kilometers. 

We use the log-normal distribution for several reasons. First, our mental 
hardware compares quantities using ratio rather than absolute difference. 
(By "quantity," I mean inherently positive values, such as distance, rather 
than signed values such as position.) In short, our hardware places quan- 
tities on a logarithmic scale. To represent our thinking, we therefore place 
the distribution on a logarithmic scale. 

Second, the normal distribution, which has only two parameters (midpoint 
and width), is simple to describe. This simplicity helps us when we trans- 
late our internal gut knowledge into the distribution's parameters. From 
our gut estimates of the lower and upper endpoints, we just find the cor- 
responding midpoint and width (on a logarithmic scale!): The midpoint is 
the geometric mean of the two endpoints, and the width is the square root 
of the ratio between the upper and lower endpoints. 
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Third, normal distributions combine simply. When we add two quantities 
represented by a normal distribution, their sum is also represented by a 
normal distribution. 

On the other hand, the normal distribution does not represent all aspects 
of our gut knowledge. In particular, the tails of the normal distribution 
are too thin, reflecting an unrealistically high confidence that our estimate 
does not contain a huge error. Fortunately, for our analyses, this problem is 
not so significant, because we will concern ourselves with the location and 
width of the central region, and not take the thin tails too seriously. With 
that caveat, we'll represent our plausible range as a normal distribution on 
a logarithmic scale: a log-normal distribution. 

Here is the other log-normal distribution, for the width 
of the UK lumping rectangle. The shaded range is the 
so-called one-sigma range /< — cr to }i + cr, where // is the 
midpoint (here, 310 kilometers) and j is the width, mea- 
sured as a distance on a logarithmic scale. In a normal 
distribution, the one-sigma range contains 68 percent 
of the probability — conveniently close to 2/3. When we ask our plausible 
ranges to contain a 2/3 probability, we are estimating a one-sigma range. 

The two log-normal distributions supply the probabilistic description re- 
quired to combine the plausible ranges. The rules of probability theory 
(Problem 7.5) produce the following two-part recipe. 

1. The midpoint of the plausible range for the area A is the product of the 
midpoint of the plausible ranges for h and w. Here, the height midpoint 
is 800 kilometers and the width midpoint is 310 kilometers, so the area 
midpoint is roughly 250 000 square kilometers: 

800 km x 310 km « 250 000 km 2 . (7.13) 

h w 

2. To compute the plausible range's width or half width, first express the 
individual half widths (the cr values) in logarithmic units. Convenient 
units include factors of 10, also known as bels, or the even- more-conve- 
nient decibels. A decibel, whose abbreviation is dB, is one-tenth of a 
factor of 10. Here is the conversion between a factor / and decibels: 

number of decibels = 10 log 1Q /. (7.14) 

For example, a factor of 3 is close to 5 decibels (because 3 is almost 
one-half of a power of ten), and a factor of 2 is almost exactly 3 decibels. 
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(These decibels are slightly more general than the acoustic decibels intro- 
duced in Problem 3.10: Acoustic decibels measure energy flux relative 
to a reference value, usually 10 -12 watts per square meter. Both kinds 
of decibels measure factors of 10, but the decibels here have no implicit 
reference value.) 

In decibels, bels, or any logarithmic unit, the half width 
(the a) of the product's range is the Pythagorean sum 
of the individual half widths (the a values). Using a x 
to represent the half width of the plausible range for 
the quantity x, the recipe is 

<74 = ^k +t7 ur < 7 ' 15 ) 

Let's apply this recipe to our example. The plausible range for the height 
( h ) was 800 kilometers give or take a factor of 1.2. On a logarithmic scale, 
distances are measured by ratios or factors, so think of a range as "give or 
take a factor of" rather than as "plus or minus" (a description that would 
be appropriate on a linear scale). A factor of 1.2 is about +0.8 decibels: 

101og 10 1.2 « 0.8. (7.16) 

Therefore, a h ~ 0.8 decibels. 

The plausible range for the width ( w ) was roughly 310 kilometers give or 
take a factor of 1.3. A factor of 1.3 is ±1.1 decibels: 

101og 10 1.3 « 1.1. (7.17) 

Therefore, cr w ~ 1.1 decibels. 

The Pythagorean sum of a h and a w is approximately 1.4 decibels: 

V0.8 2 + l.l 2 « 1.4. (7.18) 

As a factor, 1.4 decibels is, coincidentally, approximately a factor of 1.4: 

10 L4/1 ° « 1.4. (7.19) 

Because the midpoint of the plausible range is 250 000 kilometers, the UK 
land area should be 250000 square kilometers give or take a factor of 1.4. 
Retaining a bit more accuracy, it is a factor of 1.37. 

180000 ... 250 000 ... 340 000 km 2 . (7.20) 

/1.37 midpoint xl.37 

As a probability bar, the range is 




r idth of h's range 
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180000 km 2 
p ~ 1/6 



250000 km 2 
2/3 



340000 km 2 

p ~ 1/6 



The true area is 243 610 square kilometers. This area is comfortably in my 
predicted range and surprisingly close to the midpoint. 

How surprising is the accuracy of the estimate? 



The surprise can be quantified with a probability: the probability that the 
true value would be closer to the midpoint (the best estimate) than 243 610 
square kilometers is. Here, 243 610 is 2.6 percent or a factor of 1.026 smaller 
than 250 000. The probability that the true value would be within a factor 
of 1.026 (on either side) of the midpoint is the tiny shaded region of the 
log-normal distribution. 




The region is almost exactly a rectangle, so its area is approximately its 
height multiplied by its width. The height is the peak height of a normal 
distribution. In a = 1 units (in dimensionless form), this height is 1 /fin . 

The width of the shaded region, also in a = 1 units, is the ratio 
^ dB equivalent for a factor of 1.026 
dB equivalent for a factor of 1.37 ' 

which is approximately 0.16. (The factor of 2 arises because the region ex- 
tends equally on both sides of the peak.) Therefore, the shaded probabil- 
ity is approximately 0.16 /fin or only 0.07. I am surprised, but encour- 
aged, by the high accuracy of my estimate for the UK's land area. Once 
again, many individual errors — for example, in estimating journey times 
and speeds — have canceled out. 

Problem 7.4 Volume of a room 

Estimate the volume of your favorite room, comparing your plausible ranges be- 
fore and after using divide-and-conquer reasoning. 

Problem 7.5 Justifying the recipe for combining ranges 

Use Bayes' theorem to justify the recipe for combining plausible ranges. 
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Problem 7.6 Practice combining plausible ranges 

You are trying to estimate the area of a rectangular field. Your plausible ranges for 
its width and length are 1...10 meters 10. ..100 meters, respectively. 

a. What are the midpoints of the two plausible ranges? 

b. What is the midpoint of the plausible range for the area? 

c. What is the too-pessimistic range for the area, obtained by multiplying the cor- 
responding endpoints? 

d. What is the actual plausible range for the area, based on combining log-normal 
distributions? This range should be narrower than the pessimistic range in 
part (c)! 

e. How do the results change if the ranges are instead 2. . .20 meters for the width 
and 20... 200 meters for the length? 

Problem 7.7 Area of A4 paper 

If you have a sheet of standard European (A4) paper handy, either in reality or 
mentally, find your plausible range for its area A by gut estimating its length and 
width (without using a ruler). Then compare your best estimate (the midpoint of 
your range) to the official area of An paper, which is 2~ n square meters. 

Problem 7.8 Estimating a mass 

In trying to estimate the mass of an object, your plausible range for its density is 
1...5 grams per cubic centimeter and for its volume is 10. ..50 cubic centimeters. 
What is (roughly) your plausible range for its mass? 

Problem 7.9 Which is the wider range? 

Suppose that your knowledge of the quantities a, b, and c is given by these plausi- 
ble ranges: 

a = 1...10 

b = 1...10 (7.22) 

c = 1...10. 

Which quantity — abc or a 2 b — has the wider plausible range? 

Problem 7.10 Handling division 

If a quantity a has the plausible range 1...4, and the quantity b has the plausible 
range 10... 40, what are the plausible ranges for ab and for a/b? 



7.2.2 Finding one-sigma endpoints before the midpoint 

With our understanding of probability, we can explain two curious and 
seemingly arbitrary features of how we make gut estimates (Section 1.6). 
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First, we learned not to ask our gut about its best estimate. Rather, we ask 
about its lower and upper endpoints, from which we find the best estimate 
as their midpoint. Second, in finding the endpoints, the standard we use 
is "mildly surprised": We should be mildly surprised if the true value lies 
outside the endpoints. Quantitatively, "mild surprise" now means that the 
probability should 2/3 that the true value lies between the endpoints. 

The first feature, that we estimate the endpoints 
rather than the midpoint directly, is explained by 
the shape of the log-normal distribution. Imag- 
ine trying to locate its midpoint by offering your 
gut candidate midpoints. The distribution is flat 
at the midpoint, so the probability hardly varies 
as the candidates change. Thus, your gut will answer with almost identical 
sensations of ease over a wide range around the midpoint. As a result, you 
cannot easily extract a decent midpoint estimate. 

The solution to this problem explains the second curious feature: why the 
plausible range should enclose a probability of 2/3. The solution is to esti- 
mate the location of the steepest places on the curve. Those spots are the 
points of maximum slope (in absolute value). Around these values, the 
probability changes most rapidly and so does the sensation of ease. 

If we use the most convenient logarithmic units, / \ . 

where the midpoint p is 0 and the half width cr is max - sl °pe/ .max. slope 

1, then the log-normal distribution becomes the / 7 \ 

reasonably simple form P ~ 3 v 

1 2 ' ‘ 

p(x) = e~ x /2 , (7.23) l l ~ a V- i l + (T 

din 



zero slope 




where x measures half widths away from the peak. Its slope p'(x) is a maxi- 
mum where the derivative of p'(x), namely p" (x), is zero. These points are 
also called the inflection or zero-curvature points. (Where the curvature is 
zero, the curve is straight, so the dashed tangent lines pass through it.) 



Ignoring dimensionless prefactors, 

p'(x) xe~ x2/2 ; 

p"(x) ~ (x 2 — l)e _v ~ /2 . 



(7.24) 




The second derivative is zero when x = ±1. This 

value is expressed in the }i = 0, cr — 1 system of units. In the usual system. 
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x = +1 means the one-sigma points }i ± cr. So the points of maximum 
(absolute) slope — the points that our gut can most accurately estimate — are 
the one-sigma points! We find them first and then find their midpoint. 

When we estimate a 2 /3-probability range, we are finding a one-sigma 
range almost exactly: In a normal or log-normal distribution, the one-sigma 
range contains approximately 68 percent of the probability, which is almost 
exactly 2/3. For comparison, the two-sigma range contains approximately 
95 percent of the probability, which is a popular number in statistical analy- 
sis. Therefore, you may also want to find your two-sigma range (Prob- 
lem 7.11). However, the slope at the two-sigma points is approximately 
a factor of 2.2 smaller than the slope at the one-sigma endpoints, so the 
two-sigma range is somewhat harder to estimate than is the one-sigma 
range. To find the two-sigma range, first estimate the one-sigma range, and 
then double its width (on a log scale). 

This analysis of plausible ranges concludes our introduction to probabil- 
ity and the probabilistic basis of divide-and-conquer reasoning. We have 
learned that probabilities result from the incompleteness of our knowledge, 
and how acquiring knowledge changes our probabilities. In the next sec- 
tions, we will use probability to master the complexity of systems with vast 
numbers of atoms and molecules, where complete knowledge would be 
impossible. The analysis begins with a special walk, the random walk. 

Problem 7.11 Two-sigma range 

My one-sigma range for the UK's land area is 180 000... 340 000 square kilometers. 

What is the two-sigma range? 

Problem 7.12 Gold or banknotes? 

Having broken into a bank vault, do you take the banknotes or the gold? Assume 

that your capacity to carry loot is limited by mass rather than by volume. 

a. Estimate gold's value density (monetary value per mass) — for example, in dol- 
lars per gram. Give plausible ranges for your subestimates and find the result- 
ing plausible range for the value density. 

b. For your favorite banknote, give your plausible range for its value density and 
for the ratio 

value density of gold 
value density of the banknote 

c. Should you take the gold or the banknotes? Use a table of the normal distribu- 
tion to evaluate the probability that your choice is correct. 





7.3 Random walks: Viscosity and heat flow 



249 



7.3 Random walks: Viscosity and heat flow 

The great mathematician George Polya, later to become author of How to 
Solve It [38], was staying at a bed and breakfast in his adopted country of 
Switzerland and walking daily in the garden between its tall hedge rows. 
He kept running into a newlywed couple taking their walk. Were they fol- 
lowing him, or was it a mathematical necessity? From this question was 
born the study of random walks. 

Random walks are everywhere. In the card game War, 
cards wander between two players, until one player 
gets the whole deck. How long does the game last, on 
average? A molecule of neurotransmitter is released 
from a synaptic vesicle. It wanders in the 20-nanome- 
ter gap, the synaptic cleft, until it binds to a muscle 
cell; then your leg muscle twitches. How long does the molecule's jour- 
ney take? On a winter day, you stand outside wearing only a thin layer of 
clothing, and your body heat wanders through the clothing. How much 
heat do you lose? And why do large organisms have circulatory systems? 
Answering these questions requires understanding random walks. 

7.3.1 Behavior of random walks: Lumping and probabilistic reasoning 

For our first random walk, imagine a perfume molecule wandering in a 
room, moving in a straight line until collisions with air molecules deflect it 
in a random direction. This randomness reflects our incomplete knowledge: 
Knowing the complete state of the colliding molecules, we could calculate 
their paths after the collision (at least, in classical physics). However, we do 
not have that knowledge and do not want it! 

Even without that information, the random motion of one molecule is still 
complicated. The complexity arises from the generality — that the direction 
of travel and the distance between collisions can have any value. To simplify, 
we'll lump in several ways. 

Distance. Let's assume that the molecule travels a typical, fixed distance 
between collisions. This distance is the mean free path A. 

Direction. Let's assume that the molecule travels only along coordinate axes. 
Let's also study only one-dimensional motion; thus, the molecule moves 
either left or right (with equal probability). 
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Time. Let's assume that the molecule travels at a typical, fixed speed v and 
that every collision happens at regularly spaced clock ticks. These ticks are 
therefore separated by a characteristic time r = Ajv. This time is called the 
mean free time. 

In this heavily lumped, one-dimensional model, a molecule starts at an ori- 
gin ( x = 0) and wanders along a line. At each tick it moves left or right with 
probability 1 /2 for each direction. 

As time passes, the molecule spreads out. Actually, the molecule itself 
does not spread out! It has a particular position, but we just don't know 
it. What spreads out is our belief about the position. In the notation of 
probability theory, this belief is a set of probabilities — a probability distrib- 
ution — based upon our knowledge of the molecule's starting position: 

Pr (molecule is at position x at time 1 1 it was atx = 0atf = 0). (7.26) 

The changing beliefs are represented by a sequence of probability distribu- 
tions, one for each time step. For example, at 2r (after two ticks), the mole- 
cule has probability 1 / 2 of being at the origin, by either going right then left 
or left then right. At 3 t, it has probability 0 of being at the origin (why?), 
but probability 3/8 of being at x = +A. To quantify the spread, which is 
shown in the following figure, we need an abstraction and a notation. 

P = 1 

1 i i 

2 2 2 

T T 1 1 

I I 

1 1 

0 -A +A — 2A 0 +2A 

At t - 0 At t = r At t = 2t 

The position of the molecule will be x. Its expected position will be (x). 
The expected position is the weighted average of the possible positions, 
weighted by their probabilities. Both x and (x) are functions of time or, 
equivalently, of the number of ticks. However, because motion in each direc- 
tion is equally likely, the expected position does not change (by symmetry). 
So (x), which starts at zero, remains zero. 

A useful measure is the squared position x 2 — more useful because it is 
never negative, making moot the symmetry argument that made (x) = 0. 
Analogous to (x), the expected or mean squared position (x 2 ) is the average 
of the possible values of x 2 , weighted by their probabilities. 



3 3 

8 8 

1 1 

8 8 

T T 

— 3A -A +A +3A 
At t = 3t 
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Let's see how ( x 2 ) changes with time. At f = 0, the only possibility is x = 0, 
so <x 2 ) t _o = 0. After one clock tick, at t = r, the possibilities are also 
limited: x = +A or —A. In either case, x 2 = A 2 . Therefore, (x 2 ) t=T = A 2 . 

It maybe the mark of a savage, in Polya's phrase [37], to generalize a pattern 
from only two data points. So let's find t = 2r and then guess a pattern. At 
t = 2t, the position x could be — 2A, 0, or A, with probabilities 1/4, 1/2, and 
1/4, respectively. The weighted average (x 2 ) is 2A 2 : 

(x 2 ) = I x (-2A) 2 + i x (OA) 2 + | x (+2A) 2 = 2A 2 . (7.27) 

The pattern seems to be 

<x 2 ) (= „ t = nA 2 . (7.28) 



This conjecture is correct. (You can test it at t = 3r and 4r in Problem 7.13.) 
Each step contributes the squared step size A 2 to the squared spread (x 2 ). 
We saw this pattern in Section 7.2.1, when we combined plausible ranges. 
The half width of the plausible range for an area A = hw was given by 

*1 = a h + *W' < 7 - 29 > 



where v x is the half width of the plausible range for the quantity x. The 
half widths are step sizes in a random walk — random because the estimate 
is equally like to be an underestimate or an overestimate (representing step- 
ping left or right, respectively). Therefore, the half widths, like the step 
sizes in a random walk, add via their squares ("adding in quadrature"). 

The number of ticks is n = f/r, so (x 2 ), which is nA 2 , is also (A 2 / t . Thus, 



(x 2 ) _ A 2 

t t ' 



(7.30) 



As time marches on, (x 2 )/t remains A 2 /t! The invariant A 2 / r is all that we 
need to know about the details of a random walk. 



This abstraction is known as the diffusion con- 
stant. It is usually denoted D and has dimen- 
sions of L“T~ 1 . The table gives useful approxi- 
mate diffusion constants for a particle wander- 
ing in three dimensions. In d dimensions, the 
diffusion constant is defined with a dimension- 
less prefactor: 





D(m 2 s J ) 


air molecules in air 


1.5 x 10 -5 


perfume molecules in air 


1CT 6 


small molecules in water 


1CT 9 


large molecules in water 


KT 10 



D = 



1 A 2 
d t ' 



(7.31) 
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Because A/r is the speed v of the molecule between its randomizing colli- 
sions, a useful alternative form for estimating D is 
1 

D = -Av. (7.32) 

d 



Problem 7.13 Testing the random-walk dispersion conjecture 

Make the probability distribution for the particle at t = 3t and t = 4t, and com- 
pute each (x 2 ). Do your results confirm that (x 2 ) t=nT = nA 2 ? 

Problem 7.14 Higher dimensions 

For a molecule starting at the origin and wandering in two dimensions, (r 2 > = nA 2 , 
where r 2 = x 2 + y 2 . Confirm this statement for t = 0...3t. 



A random and a regular walk are analogous in having an invariant. For 
a regular walk, it is ( x)/t: the speed. For a random walk, it is (x 2 )/t: the 
diffusion constant. (The diffusion constant is to a random walk as the speed 
is to a regular walk.) However, the two walks differ in a scaling exponent. 
For a regular walk, (x) oc t: The scaling exponent connecting position and 
time is 1. For a random walk, we'll use the rms (root-mean-square) position 

X rms = Jix 2 ) (7.33) 



► 



as a measure analogous to (x) for a regular walk. Because (x 2 ) oc f, the 
rms position x rms is proportional to t 1/2 : In a random walk, the scaling ex- 
ponent connecting position and time is only 1 /2. This scaling exponent has 
profound effects on heat, drag, and diffusion. 



As an example of the effect, let's apply our knowl- 
edge of diffusion and random walks to a familiar sit- 
uation. Across a room someone opens a bottle of per- 
fume or, if your taste in problems is pessimistic, a 
lunch of leftover fish. 




room size (L) 



How long until odor molecules reach your nose? 



As time passes, the molecule wanders farther afield, with its rms position 
growing proportional to f 1/2 . As a lumping approximation, imagine that 
the molecule is equally likely to be anywhere within a distance x rms of the 
source (the perfume bottle or leftover fish). For the molecule to have a sig- 
nificant probability to be at your nose, x rms should be comparable to the 
room size L. Because (x 2 ) = Lit, the condition is L 2 ~ Dt, and the required 
diffusion time is t ~ L 2 /D. (For another derivation, see Problem 7.16.) 
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For perfume molecules diffusing in air, D is roughly 10 6 square meters per 
second. For a 3-meter room, the diffusion time is roughly 4 months: 

7 2 pa m ~)2 

t ~ — — - — - w 10 7 s « 4 months. (7.34) 

D 10 -6 m 2 s -1 



This estimate does not agree with experiment! After perhaps a minute, 
you'll notice the aroma, whether perfume or fish. Diffusion is too slow to ex- 
plain why the odor molecules arrive so quickly. In reality, they make most 
of the journey using a regular walk: The small, unavoidable air currents 
in the room transport the molecules much farther and faster than diffusion 
can. The speedup is a consequence of the change in scaling exponent, from 
1 / 2 (for random walks) to 1 (for regular walks). 



Problem 7.15 Diffusion constant of air 

Estimate the diffusion constant for air molecules diffusing in air by using 
1 

D ~ - x mean free path x travel speed 



(7.35) 



and the mean free path of air molecules (Section 6.4.5). This value is also its ther- 
mal diffusivity x air and its kinematic viscosity v air ! 



Problem 7.16 Dimensional analysis for the diffusion time 

Use dimensional analysis to estimate the diffusion time t based on L (the relevant 
characteristic of the room) and D (the characteristic of the random walk). 



A similar estimate explains the existence of circulatory systems. Imagine 
an oxygen molecule diffusing through our body to a muscle cell, where 
its services are needed to burn glucose and produce energy. The diffusion 
distance (our body size) is L ~ 1 meter. The diffusion constant for an oxygen 
molecule in water (a small molecule in water) is roughly 10 -9 square meters 
per second. The diffusion time is roughly 10 9 seconds or 30 years: 

1 ~ % ~ 10 -9^2 S -t = 109 s * 30 y ears - < 7 - 36 > 



Over long distances — long compared to the mean free path A — diffusion 
is a slow method of transport! Large organisms, especially warm-blooded 
organisms with high metabolic rates, need another solution: a circulatory 
system. It transports oxygen much more efficiently than diffusion can, just 
as air currents do for perfume. The circulatory system, a branching network 
of ever-smaller capillaries, ends once the typical distance between the small- 
est capillaries and a cell is small enough for diffusion to be efficient. 
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Another biological example of short-distance diffusion is the gap between 
two neighboring neurons, called the synaptic cleft. Its width is only L ~ 
20 nanometers. Signals between neighboring nerve cells, or between a neu- 
ron and a muscle cell, travel chemically, as neurotransmitter molecules. 



Let's estimate the diffusion time for a neurotransmitter molecule. A neuro- 
transmitter is a large molecule, and it is diffusing in water, so D ~ 10 -10 
square meters per second. Its diffusion time is 4 microseconds: 

L? ~ (2 x 10~ 8 m) 2 
D 10 -10 m 2 s -1 



t ~ =- 



= 4 x 10 6 s. 



(7.37) 



Without a comparison, this time means little: We cannot say right away 
whether the time is long or short. However, because it is much smaller than 
the spike-timing accuracy of neurons (about 100 microseconds), the time to 
cross the synaptic cleft is small enough not to affect nerve-signal propaga- 
tion. For transmitting signals between neighboring neurons, diffusion is an 
efficient and simple solution. 

Problem 7.17 Probability of being at the origin 

Polya's analysis of his encounters with the newlywed couple required first finding 
the probability p n that a random walker is at the origin after n clock ticks (p 0 = 1). 
For a d-dimensional random walk, find the scaling exponent /3(d) in p n <x nP d K 

Problem 7.18 Expected number of visits to the origin 

Using your result from Problem 7.17, estimate the expected number of visits a 
random walker in one and two dimensions makes to the origin (summed over all 
ticks n > 0). Thereby explain Polya's theorem [36], that a random walker in one 
or two dimensions always returns to the origin. What makes a three-dimensional 
random walk different from one or two dimensions? 



7.3.2 Types of diffusion constants 

Because random walks are everywhere, there are several kinds of diffusion 
constants. They are named differently depending on what is diffusing, but 
they share the mathematics of the random walk. Therefore, they all have 
dimensions of length squared per time (L 2 T -1 ). 



what is diffusing 


name of diffusion constant 


symbol 


particles 


diffusion constant 


D 


energy (heat) 


thermal diffusivity 


K 


momentum 


kinematic viscosity 


V 
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A handful of useful diffusion constants (for particles) were tabulated on 
page 251. To complement that table, here are a few useful, approximate 
thermal diffusivities and kinematic viscosities. 



^air 


heat diffusing in air 


1.5 x 10 -5 m 2 s -1 


hdr 


momentum diffusing in air 


1.5 x 10 -5 


^water 


heat diffusing in water 


1.5 x 10 -7 


^water 


momentum diffusing in water 


l.OxlCr 6 



In air, all three diffusion constants — D for molecules, K for energy, and v for 
momentum — are roughly 1.5 x 10 -5 square meters per second. Their simi- 
larity is no coincidence. The same mechanism (diffusion of air molecules) 
transports molecules, energy, and momentum. 

In water, however, the molecular diffusion constant D is several orders of 
magnitude smaller than the heat- and momentum-diffusion constants ( k 
and v , respectively). Even the momentum- and heat-diffusion constants 
differ roughly by a factor of 7. This dimensionless ratio, v / k, is the Prandtl 
number Pr. For water, I usually remember v , because it is just a power of ten 
in SI units, and remember the Prandtl number — the lucky number 7 — and 
use these values to reconstruct k. 

7.3.3 Thermal diffusivities of liquids and solids 

The large discrepancy between the molecular and thermal diffusion con- 
stants in water indicates that our model for diffusion in water is not com- 
plete. The problem is not limited to water. If we had made a similar com- 
parison for any solid, comparing the molecular and thermal diffusion con- 
stants (D and k), the discrepancy would have been even larger. 

Indeed, in liquids and solids, in contrast to gases, heat is not transported 
by molecular motion. In a solid, the molecules sit at their sites in the lattice. 
They vibrate but scarcely wander. In a liquid, molecules wander but only 
slowly. Their tight packing keeps the mean free path short and the diffusion 
constant small. Yet, as everyday experience and the large k/D ratio suggest, 
heat can travel quickly in liquids and solids. The reason is that heat is trans- 
ported by miniature sound waves rather than by molecular motion. The 
sound waves are called phonons. 

By analogy to photons, which represent the vibrations of the electromag- 
netic field, phonons represent the vibrations of the lattice entities (the atoms 
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or molecules of the liquid or solid). One molecule vibrates, shaking the 
next molecule, which shakes the next molecule. This chain is the motion of 
a phonon. Phonons act like particles: They travel through the lattice, bounc- 
ing off impurities and other phonons. Like ordinary particles, they have a 
mean free path and a propagation speed. These properties of their random 
walk determine the thermal diffusivity: 



K 



1 

3 



x phonon mean free path x propagation speed. 



(7.38) 



The propagation speed is the speed of sound c s , because phonons are tiny 
sound waves (familiar sound waves contain many phonons, just as light 
beams contain many photons). Sound speeds in liquids and solids are 
much higher than the thermal speeds, so we already see one reason why k, 
the diffusion constant for heat, is larger than D, the diffusion constant for 
particles. 

The mean free path A measures how far a phonon travels before bouncing 
(or scattering) and heading off in a random direction. Let's write A = fin, 
where n is the typical lattice spacing (3 angstroms) and fi is the number of 
lattice spacings that the phonon survives. 



Then the thermal diffusivity becomes 




c s pa. 



(7.39) 



For water, our favorite substance, c s ~ 1.5 kilometers per second (as you 
estimated in Problem 5.58). Then the predicted thermal diffusivity becomes 

K W ater ~ /5 X 1.5 X 10 -7 m 2 S -1 . (7.40) 

Because the actual thermal diffusivity of water is 1.5 x 10 -7 square meters 
per second, our estimate is exact if we use fi = 1. This choice is easy to 
interpret and remember: In water, the phonons travel roughly one lattice 
spacing before scattering in a random direction. This distance is so short 
because water molecules do not sit in an ordered lattice. Their disorder 
provides irregularities that scatter the phonons. (At the same time, this 
mean free path is much larger than the mean free path of tightly packed 
atoms or molecules, which move a fraction of an angstrom before getting 
significantly deflected. Therefore, even in a liquid, k is much larger than 
the molecular diffusion constant D.) 

To estimate k for a solid, let's use /c watcr along with the scaling relation 

K ex Ac s . (7.41) 
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In a typical solid, the sound speed c s is 5 kilometers per second — roughly a 
factor of 3 faster than in water. The mean free path A is also longer than in 
the totally disordered lattice of a liquid. In a solid without too many lattice 
defects, and at room temperature, a phonon travels a few lattice spacings 
before scattering — compared to just one lattice spacing in water. 

These two differences, each contributing a factor of 3, make the typical ther- 
mal diffusivity of a solid a factor of 10 larger than that of water: 

K soiid ~ K water X 10 « 1.5 x 10“ 6 m 2 s" 1 . (7.42) 

Rounded to 10 -6 square meters per second, this value is our 
canonical thermal diffusivity of a solid — for example, sand- 
stone or brick. The table also shows a new phenomenon: 

For metals, k is much larger than our canonical value. Al- 
though a small discrepancy could be explained by a few 
missing numerical factors, this significant discrepancy indi- 
cates a missing piece in our model. 

Indeed, in metals, heat can also be carried by electron waves, 
not just phonons (lattice waves). Electron waves travel much 
faster and farther than phonons. Their speed, known as 
the Fermi velocity, is comparable to the orbital speed of an 
atom's outer-shell electron. As you found in Problem 5.36, 
for hydrogen this speed is me, where m is the fine-structure constant (~ 10 -2 ) 
and c is the speed of light. This speed, roughly 1000 kilometers per second, 
is much faster than any sound speed! As a result, the thermal diffusivity in 
metals is large. As you can see in the table, for a good conductor, such as 
copper or gold, K ~ 10 -4 square meters per second. 

7.3.3. 1 Heating a skillet 

To feel a thermal diffusivity, place a thin cast-iron skillet on a hot stove. 
How long does it take for the top surface to feet hot? 

The hot stove supplies blobs of heat (of energy) that wander back and forth: 
The heat blobs perform a random walk. In a random walk, a particle with 
diffusion constant D wandering for a time t reaches a distance z ~ f~Dt ; 
we used this lumping model in Section 7.3.1 to estimate the diffusion time 
across a synaptic cleft. Here, the particle is a blob of heat, so the diffusion 
constant is k. Thus, the hot front reaches a distance z ~ fid into the skillet. 



k (m 2 s *) 



Au 


1.3 x 10 -4 


Cu 


l.ixicr 4 


Fe 


2.3 x 10 -5 


air 


1.9 x 10 -5 


sandstone 


l.ixicr 6 


brick 


0.5 x 10 -6 


glass 


3.4 x 10 -7 


water 


1.5 x 10 -7 


pine 


0.9 x 10 -7 
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In this lumping picture, the temperature profile 
is a rectangle with a moving right edge — repre- 
senting the heat wave moving upward and into 
the skillet. For my 1 -centimeter- thick cast-iron 
skillet, the hot front should reach the top of the 
skillet in roughly 4 seconds: 

, hi (io- 

/C Fe 



~ 2 m) 2 
2.3 x 10 -5 m 2 s -1 



4 s. 



( 7 . 43 ) 



1 ^stove 




1 hot zone 


► | 


1 


Troom 1 



z = 0 



\J~Pi 



z — L 



Don't try the following experiment at home. But for the sake of lumping 
and probabilistic reasoning, I set our flattest cast-iron skillet on a hot electric 
stove while touching the top surface. After 2 seconds, my finger involuntar- 
ily jumped off the skillet. 

The discrepancy of a factor of 2 between the predicted 
and actual times is not bad considering the simplic- 
ity, or crudity, of the lumping approximations that it 
incorporates. However, there is a bigger discrepancy. 

The model predicts that, for the first 4 seconds, the top 
of the skillet remains at room temperature. One feels 
nothing until, wham, the full temperature of the stove 

However, experience suggests that the skillet's tem- 
perature starts rising before the skillet becomes too 
hot to touch, and then it monotonically approaches 
the stove temperature — as sketched in the figure. 

One reason for this discrepancy could be the skillet's 
top surface. In our model, the skillet has only a bot- 
tom surface and is infinitely thick. The top surface might alter the heat 
flow. However, the infinite-slab assumption isn't the fundamental problem 
(correcting the assumption turns out to speed up the heating process by a 
factor of 2). Even if we fix it, the model would still make the bogus predic- 
tion of a sudden jump to the stove temperature. It's hard to believe after 
the exhortations on the power of lumping, but we have lumped too much. 

To improve the model, let's incorporate a more realistic temperature pro- 
file. Beyond the canonical lumping shape of a rectangle, the next simplest 
shape is a triangle (the integral of a rectangle). We will therefore replace 
the rectangular temperature profile with a triangle having the same area as 
the rectangle. Because of the factor of 1/2 in the area of a triangle, the hot 
zone now extends to 2 JkF instead of to / xt . 



f stove 
_Jhot_ 
Boom 



feels very hot 



7(f) 



place 
on stove 

hits at 4 seconds. 

Ttove 




place 
on stove 



skillet top 
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The area of the triangle is proportional to the heat 
transferred to the skillet. By matching the trian- 
gle's area to the rectangle's area, we preserve this 
integral quantity. When making lumping mod- 
els, preserving integral quantities is usually more 
robust than is preserving differential quantities 
(such as slope). 



i 




Ttove 



In the triangular-lumping model, you feel 
nothing until the tip of the triangle reaches 
the top of the skillet. Because the triangu- 
lar hot front extends a factor of 2 farther 
than the rectangular hot front (2 Jxt ver- 
sus -[id ), and because diffusion times are 
proportional to distance squared, the re- 
quired time falls by a factor of 4. Thus, it 
falls from 4 seconds to 1 second. At 1 sec- 
ond, the hot front arrives at the top of the skillet, which starts to feel warm. 
Then the temperature slowly increases toward the stove temperature. This 
next-simplest model makes quite realistic predictions! 



4k 



t (t) 




Problem 7.19 Cooling the Moon 

How long does it take for the Moon, with a radius of 1.7 x 10 6 meters to cool sig- 
nificantly by heat diffusion through rock? Given that the Moon is now cold, what 
do you conclude about the mechanism of cooling? 

Problem 7.20 Diffusion with a bit of drift: Breaking the bank at Monte Carlo 

You can play the card game blackjack such that your probability of winning a hand 
is p — 0.51 and of losing is 1 — p = 0.49. You start with N betting units, the 
stake; and you bet 1 unit on each hand. The goal of this problem is to estimate the 
threshold N such that you are more likely to break the bank than to lose the stake. 

Let x n be your balance after the //th hand. Thus, x 0 — N. Losing your stake (the 
N units) corresponds to x = 0. To estimate N, extend the random-walk model to 
account for drift: that the probabilities of moving left and right are not equal. 

a. What is the symbolic expression for breaking the bank? 

b. Sketch your expected balance (x) versus the number of hands ;/ (on linear axes). 

c. Sketch, on the same axes, the dispersion x rms versus n . 

d. Explain graphically "a significant probability of breaking the bank." 

e. Thus, estimate the required stake N. 
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7. 3. 3. 2 Baking 

A skillet on a stove gets heated from one side. An equally important kind of 
cooking, and one that helps us practice and extend our lumping model of 
random walks, is heating from two sides: baking. As an example, imagine 
baking a slab of fish that is L = 1 inch (2.5 centimeters) thick. 

How long should it bake in the oven? 



A first, and quick, analysis predicts L 2 /k, the characteristic time for heat 
diffusion, where k is the thermal diffusivity of water (organisms are mostly 
water). However, this simple model predicts an absurd time: 

, L 2 (2.5 cm) 2 



v water 



1.5x10 3 cm 2 s 



70 minutes. 



(7.44) 



After more than an hour in the oven, the fish will be so dry that it might 
catch fire, never mind not being edible. The model also ignores an impor- 
tant quantity: the oven temperature. Fixing this hole in the model will also 
improve the time estimate. 



Why does the oven temperature matter? 



The inside of the fish must get cooked, which means that the proteins dena- 
ture (lose their folded shape) and the fats and carbohydrates change their 
chemistry enough to become digestible. This process happens only if the 
food gets hot enough. Thus, a cold oven could not cook the fish, even after 
the fish reached oven temperature. What temperature is hot enough? From 
experience, a thin piece of meat on a hot skillet (at, say, 200 °C) cooks in less 
than a minute. At the other extreme, if the skillet is at 50 °C, unpleasantly 
hot to the touch but not much hotter than body temperature, the meat never 
cooks. A round, intermediate temperature of 100 °C, enough to boil water, 
should be enough to cook meat thoroughly. 

If we set the oven to, say, 180 °C, the fish will then be 
cooked once its center reaches the midpoint of the room 
(20 °C) and oven temperatures, which is 100 °C. In this 
improved model of cooking, the interior of the fish starts 
at T room ~ 20 °C, and the oven holds the top and bottom surfaces of the fish 
at T oven = 180 °C (360 °F). With this model, and using triangular tempera- 
ture profiles, we will estimate the time required for the center of the fish to 
reach the midpoint temperature of 100 °C. 



oven at 180 C 
fish starts at 20 °C 
oven at 180 °C 
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Two triangular hot zones, one from each surface, move 
toward the center of the fish. Before the zones meet, the 
center of the fish, at z = L/2, is cold (room temperature). 
The fish, unless it is very fresh, is not ready to eat. But 
soon the triangles will meet. 




Each triangle extends a distance 2 fid. They first meet 
(at x = L/2) when 2 fid = L/2. Thus, f meet ~ L 2 /16k. 
From that moment, the center warms up as the triangle 
fronts overlap ever more. The fish is cooked when the cen- 
ter reaches the mean of the room and oven temperatures 
(which is 100 °C). 




In dimensionless temperature units, where T room corre- 
sponds to T = 0 and T oven to T = 1, the cooking crite- 
rion is T = 1/2. Each triangle wave therefore contributes 
T = 1/4. The left triangle, representing the hot front in- 
vading from the bottom surface, then passes through the 
points (z = 0, T = 1) and (z = L/2, T = 1/4). It reaches the z axis (T = 0) 
when z = 2L/3. The corresponding triangle-zone diffusion time is given by 
2 4ict = 2L/3, whose solution is 




a 

C 



t 



cooked 



L? 

9 K 



(7.45) 



For our 2. 5-centimeter- thick fish, the time is roughly 7 minutes: 



(2.5 cm) 2 



1 



9 1.5 x 10 -3 cm 2 s -1 



7 minutes. 



(7.46) 



This estimate is reasonable. My experience is that a fish fillet of this thick- 
ness requires about 10 minutes in a hot oven to cook all the way through 
(and further baking only dries it out). 

Problem 7.21 Baking too long 

In the model of two approaching triangle hot fronts, the central temperature starts 
to rise at t ~ L 2 /16k, and it reaches the halfway temperature at f coo k e d ~ L 2 /9 k . 
When does it reach the oven temperature? Sketch the central temperature versus 
time, labeling interesting values. 

Problem 7.22 Cooking an egg 

Baking a 6-kilogram turkey requires, from experience, 3 to 4 hours (you get to 
predict this time in Problem 7.34). Use proportional reasoning to estimate the time 
required to boil an egg. 
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7.3.4 Boundary layers 

In cooking, the hot zone diffuses inward from the hot surface. For the most 
tangible example of this random walk, rub your finger on the blade of a (sta- 
tionary!) window fan. Your finger comes away dusty and leaves a dust-free 
streak on the blade. But why is any dust on the blade at all? When the fan 
was turning, why didn't the air streaming by the blade blow off the dust? 

The answer lies in the concept of the boundary layer. For the cooking ex- 
amples (Sections 7.3.3. 1 and 7.3. 3. 2), this layer is the expanding hot zone. It 
arises from the boundary constraint (the temperature), which diffuses into 
the skillet or the fish. For the fan blade, the analogous constraint is that, next 
to the blade, the fluid has zero velocity with respect to the blade. The con- 
dition, called the no-slip boundary condition, has the justification that the 
fluid molecules next the surface get caught by the inevitable roughness at 
the surface. (For a historical and philosophical discussion of the subtleties 
of this boundary condition, see Michael Day's article on "The no-slip con- 
dition of fluid dynamics" [8].) 

Starting at the blade surface, a zero-speed or, equivalently, zero-momentum 
zone diffuses into the fluid — just as the stove temperature diffuses into the 
skillet or the oven temperature into the fish fillet. After a growth time f, 
the zero-momentum front has diffused a distance <5 ~ Jvt, where v is the 
diffusion constant for momentum (the kinematic viscosity). The distance S 
is the boundary-layer thickness. Within the boundary layer, the fluid moves 
more slowly than the fluid in the free stream. Using a rectangular lumping 
picture, the fluid speed is zero within the layer and full speed outside it. 
Therefore, dust particles entirely in the layer remain on the blade. 

To estimate the boundary-layer thickness, imagine a window fan with 
a blade width / and rotation speed v at the widest part of the blade. 

The growth time is t ~ l/v. For a generic window fan sweeping out a 
diameter of 0.5 meters, the blade width / may be roughly 0.15 meters. 

If the fan rotates at roughly 15 revolutions per second or co ~ 100 ra- 
dians per second, the blade speed v is 10 meters per second: 

0.1m x 100 rads -1 = 10 ms -1 . (7.47) 

arc radius co 




0.5 m 



At this speed, the growth time t is 0.015 seconds. In that time, the zero-mo- 
mentum constraint diffuses roughly 0.5 millimeters from the fan blade: 




x 



0.05 cm. 



(7.48) 
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Dust particles that are significantly smaller than 0.5 millimeters feel no air 
flow (in this simplest lumping picture) and thus remain on the fan blade un- 
til your finger rubs them off. For the same reason (boundary layers), simply 
rinsing dirty dishes in water will not remove the thin layer of food next to 
the surface. Proper cleaning requires scrubbing (a sponge) or soap (which 
gets underneath the food oils). 

Problem 7.23 Reynolds number in the boundary layer 

Based on the boundary-layer thickness 5 ~ 4vt, estimate the Reynolds number in 
a boundary layer on an object of size L (a length) moving in a fluid at speed v. 

Problem 7.24 Turbulence in the boundary layer 

At Reynolds numbers comparable to 1000, flows usually become turbulent. Use 
Problem 7.23 to estimate the main-flow Reynolds number at which the boundary 
layer becomes turbulent. For a smooth ball similar in size to a golf ball, what flow 
speed is required? Suggest an explanation for the dimples on golf balls. 



7.4 Transport by random walks 

When a hot-oven temperature front diffuses into a slab of fish or the no-slip, 
zero-momentum front diffuses into a fluid, it comes with a heat or momen- 
tum flow. With our random-walk model, we can estimate the magnitude 
of these flows, and thereby understand the drag forces on fog droplets and 
bacteria (Problem 7.26) and why we feel cold on a winter day without thick 
clothing (Section 7.4.4). 



7.4.1 Diffusion speed 



The essential property of a random walk is that the distance traveled is not 
proportional to the time, as it would be in a regular walk, but rather to 
the square root of the time. This change in scaling exponent means that 
the speed at which heat, momentum, or particles diffuse depends on the 
diffusion distance. When the distance is L, the diffusion time is t ~ L 2 /D, 
where D is the appropriate diffusion constant. Thus, the transport speed is 
comparable to D/L: 



L L _ D 
t ~ L 2 /D ~ V 



(7.49) 



This speed depends inversely on the distance. This scaling is consistent 
with our calculations showing that diffusion is a terribly slow means of 
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transport over long distances (for example, for perfume molecules diffusing 
across a room) but fast over short distances (for example, for neurotransmit- 
ter molecules diffusing across a synaptic cleft). 

When the diffusing quantity is momentum, the appropriate diffusion con- 
stant is v , and the diffusion speed is v/L. Thus, the Reynolds number, 
% ow L/ v , is the ratio u flow /^diffusion — for the same reason that it is the ratio 
of times f diffusion /f flow (as you will find in Problem 7.32). Using the diffusion 
speed, we can estimate fluxes and flows. 



7.4.2 Flux 

Transport is measured by the flux: 
flux of stuff 



amount of stuff 



(7.50) 



area x time 

As we found in Section 3.4.2, the flux is also given by 

flux of stuff = S [ U ^ x transport speed, 
voiume 

When the stuff travels by diffusion, the transport speed is the diffusion 
speed D/L (from Section 7.4.1). The resulting flux is 



(7.51) 



flux of stuff 
In symbols, 

F 



stuff diffusion constant 



volume 



distance 



D 

n L' 



(7.52) 



(7.53) 



g a P 



where n is the concentration (stuff per volume), D is the appropriate diffu- 
sion constant (depending on what is diffusing), and L is the distance. 

An important application is diffusion across a gap. The gap could 
be a synaptic cleft, with a gap width L ~ 20 nanometers and 
with different neurotransmitter concentrations on its two sides. 

Or it could be a shirt (L ~ 2 millimeters) with different tempera- 
tures — concentrations of energy — on the inside and outside. On 
one side, the density of stuff is ;q; on the other side it is n 2 . Then 
there are two fluxes in the gap, left to right and right to left: 




n 2 



l->2 



2— >1 



D 

?h r 

D 

” 2 l- 



(7.54) 
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The net flux F is their difference: 

F~ <»*-»,)£ =D^. 



(7.55) 



The concentration difference ?z 2 — divided by the gap size L is an impor- 
tant abstraction: the concentration gradient. It measures how rapidly the 
concentration varies with distance. Using it, the flux becomes 

flux = diffusion constant x concentration gradient. (7.56) 



This result, called Fick's law, is exact (hence the equals sign instead of ~). 
In calculus form, the concentration gradient is An /Ax, so 



F = 




(7.57) 



If the diffusing stuff is particles, then the appropriate diffusion constant is 
just D, and the result can be used as is. If the diffusing stuff is momen- 
tum, then the diffusion constant is the kinematic viscosity v, and the flux 
is closely connected to a drag force (Problem 7.25). 

Problem 7.25 Momentum flux produces drag 

If the diffusing stuff is momentum, then the diffusion constant is the kinematic 
viscosity v , and the concentration gradient is the gradient of momentum density. 
Thus, explain why Fick's law becomes 

viscous stress = pv x velocity gradient, (7.58) 

where viscous stress is viscous force per area (pv is the dynamic viscosity ;/). 

Problem 7.26 Stokes drag 

In this problem, you use momentum flux (Problem 7.25) to estimate the drag force 
on a sphere of radius r in a flow at low Reynolds number (Re « 1). If Re « 1, the 
boundary layer (Section 7.3.4) — the region over which the fluid velocity changes 
from zero to the free-stream velocity v — is comparable in thickness to r . Using that 
information, estimate the viscous drag force on the sphere. 

If the diffusing stuff is heat (energy), the diffusion constant is the thermal 
diffusivity K, and the concentration gradient is the gradient of energy den- 
sity. Thus, heat flux is 

heat flux = thermal diffusivity x energy-density gradient. (7.59) 

To figure out the meaning of energy-density gradient, let's start with energy 
density itself, which is energy per volume. We usually measure it using tem- 
perature. Therefore, to make the eventual formula applicable, let's rewrite 
energy density in terms of temperature: 
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energy energy 

— = — x temperature. 

volume volume x temperature 

The complicated quotient on the right splits into two simpler factors: 

energy _ mass energy 

volume x temperature volume mass x temperature 

The first factor, mass per volume, is just the substance's density p. 
second factor is called the specific heat c p . The most familiar specific heat is 
water's: 1 calorie per gram per °C. That is, raising 1 gram of water by 1 °C 
requires 1 calorie (« 4 joules). 

Using these abstractions, 
energy 

— ; — 5 = pc„, (7.62) 

volume x temperature 1 



(7.60) 

(7.61) 

The 



To get energy density, or energy per volume, we multiply by temperature 
energy 



volume 



= pc p T. 



(7.63) 



Now that we have energy density, we can find the energy-density gradient. 
Because p and c p are constants (at least for small temperature and position 
changes), any gradient of energy density is due to the temperature gradient 
AT /Ax: 



energy-density gradient = pc p x temperature gradient. (7.64) 

Fick's law, when energy is the diffusing stuff, tells us that 

energy flux = thermal diffusivity x energy-density gradient, (7.65) 



so 



heat (energy) flux = k x pc p x temperature gradient. (7.66) 

energy-density gradient 



The shaded combination pc p K occurs in any heat flow driven by a temper- 
ature gradient. It is a powerful abstraction called the thermal conductivity 



K: 



thermal conductivity = density 

K p 



( specific \ f thermal \ 

\ heat ) \ diffusivity ) ' 



(7.67) 



Thermal conductivity has dimensions of power per length per temperature, 
and is often quoted in units of watts per meter kelvin (W m -1 K -1 ). Using K, 
Fick's law for energy flux in terms of temperature gradient AT /Ax becomes 
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F = 




(7.68) 



where AT is the temperature difference, and Ax is the gap size. 

Now let's estimate a few important thermal conductivities in order to un- 
derstand heat flow around us. 



7.4.3 Thermal conductivity of air 

To estimate our heat loss standing outside on a cold winter's day, we need 
to estimate the thermal conductivity of air. 

Why do we estimate the thermal conductivity of air rather than of clothing? 



The purpose of clothing is to trap air so that heat flows via conduction — that 
is, by diffusion — rather than via the faster process of convection. (If the 
perfume molecules of Section 7.3.1 could be similarly limited to diffusion, 
the perfume aromas would travel very slowly.) 

Because K = pc p K, estimating K splits into three subproblems, one for each 
factor. The density of air p air is just 1.2 kilograms per cubic meter (slightly 
more accurate than 1 kilogram per cubic meter). The thermal diffusivity 
K air is 1.5 x 10 -5 square meters per second. 

The specific heat c p is not as familiar, but we can estimate it. As for water, 
it measures the thermal energy per mass per temperature: 
thermal energy 

C = — . (7.69) 

p mass x temperature 



The thermal energy per particle is comparable to k B T, where k B is Boltz- 
mann's constant, so the energy per temperature is comparable to k B . Thus, 



c 



p 



_Jf_ B_ 
mass 



(7.70) 



Our thermal energy, which is comparable to k B T, is for one particle. The 
corresponding mass in the denominator is the mass of one particle, which 
is an air molecule. 



To convert k B and the mass to human-sized values, we multiply each by Avo- 
gadro's number N A . Then we replace k B N A with the universal gas constant 
R, and mass x N A with the molar mass ?w molar . The result is 

c ^ = r 

p molecular mass x N A m molar ' 



(7.71) 
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where m molai is the molar mass of air. 

This expression is correct except for a dimensionless prefactor. Air is mostly 
nitrogen, for which the prefactor is 7/2. This magic number can be made 
slightly less magical as follows: 



7 

2 



spatial dimensions rotational directions + ^ 



(7.72) 



Each spatial dimension contributes one 1/2 through a term in the transla- 
tional kinetic energy of a nitrogen molecule; thus, the translational piece 
contributes 3/2 to the prefactor. Each rotational direction contributes a 1/2 
through a term in the rotational energy of a nitrogen molecule. Because ni- 
trogen is a linear molecule, there are only two rotational directions, so the 
rotational directions contribute 2/2 to the prefactor. If we stop here, at 5/2, 
we would have the prefactor to find c v , the specific heat while holding the 
volume constant. The final term, +1, accounts for the energy required to ex- 
pand a gas as it is heated and held at constant pressure. Therefore, c p gets 
a prefactor of 7/2. (For a more detailed discussion of the reasoning behind 
this calculation, see the classic Gases, Liquids and Solids and Other States of 
Matter [43, pp. 106].) 

Air is mostly diatomic nitrogen, so m mo i ar is roughly 30 grams per mole. 
Then c p is roughly 10 3 joules per kilogram kelvin: 



R 



7 
— x 



8J 



1 mol 



10 3 J 



p 2 m mo i ar 2 molK 3x10 2 kg kgK 



(7.73) 



„-l 

molar 



Putting together the three pieces. 



fC air « 1.2 kg m 



-3 , 



10 3 Jkg 



-l 



x 1.5x10 5 m 2 s 1 



0.02 



W 

mK' 



(7.74) 



Before using the thermal conductivity, let's try out the specific heat of air on 
an old method of air conditioning. One summer I lived in a tiny Manhattan 
apartment (30 square meters). Summers are hot in New York City, and the 
beautiful people flee for the cooler beach areas — cooler thanks partly to the 
high specific heat of water (Problem 7.27). Because of global warming and 
the old electrical wiring in the apartment building, too old to handle an 
air-conditioning unit, the apartment reached 30 °C at night. A friend who 
grew up before air conditioning suggested taking a wet sheet and using a 
fan to blow air past it. 
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► 



How much does this system cool the apartment? 



As heat from the room air evaporates the water from the sheet, the room 
air cools, as if the room were sweating. The energy required to evaporate 
the water is 



^ ^water^vap' (7.75) 

where m wa ter is the mass of water held by the wet sheet and L vap is the heat 
of vaporization of water. This energy is the demand. 

To lower the room's temperature by AT requires using up a thermal energy 

E ~ p air Vc p AT, (7.76) 

where V is the volume of the room and c p is the specific heat of air. This 
energy is the supply. 

Equating supply and demand gives an equation for AT : 

p aiI Vc p AT ~ m water L vap . (7.77) 



Its solution is 



AT 



■yyi T 

' 1 ^ water ^vap 
Tair^ C p 



(7.78) 



Now we estimate and plug in the needed values. A typical room is roughly 
3 meters high, so the apartment's volume was roughly 100 cubic meters: 

y ~ 30m 2 x 3m w 100m 3 . (7.79) 

area height 



To estimate m water , I imagined that the sheet was about as wet as it would be 
after coming out of the washing machine with its fast spin cycle. That mass 
feels like slightly more than 1 kilogram, so m water ~ 1 kilogram. Finally, 
the heat of vaporization of water is about 2 x 10 6 joules per kilogram (as we 
estimated in Section 1.7.3 using a home experiment). 



Putting in all the numbers, AT is about 20 °C (or 20 K): 



AT 



1 kg x 2 x 10 6 J kg 1 



lkgm 3 x 100 m 3 x 10 3 Jkg X K 1 



= 20 K. 



(7.80) 



This change would have turned the hot 30 °C room into a cold 10 °C room, if 
the cooling had been 100-percent efficient. Because some heat comes from 
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the walls (and from the fan motor), AT will be less than 20 °C — perhaps 
10 °C, leaving the room at a pleasant and sleepable temperature of 20 °C. 
This calculation shows not only that evaporative cooling is a reasonable 
method of air conditioning, but also that our estimate for the specific heat 
of air is reasonable. 

Problem 7.27 Dimensionless specific heat of water 

The specific heat per air molecule is 3.5 so the dimensionless specific heat of air, 
measured relative to is 3.5. What is the dimensionless specific heat of water? 



7.4.4 Keeping warm on a cold day 



Now we have assembled the pieces to understand why 
we dress warmly on a cold day. Our starting point is the 
heat flux: 






(7.81) 



30 °C 
skin 



trapped air 
(shirt) 



Ax 



0°C 

winter 

air 



where AT = T 2 — Tj is the temperature difference across a gap. Ax is the 
gap size, and K is the thermal conductivity of the gap material. Here, the 
gap material is air — the clothing serves to trap the air. 

Let's say that the air outside is at T 1 =0 °C and that skin is at T 2 = 30 °C 
(slightly lower than the internal body temperature of 37 °C). Then AT = 
30 K. Against the advice of your elders, you dress in a thin T-shirt — for 
decency, a very long one. A thin T-shirt has thickness Ax of roughly 2 mil- 
limeters. With these parameters, the heat flux through the shirt becomes 
300 watts per square meter: 

AT ' 

F « 0.02 — x = 300 Wm“ 2 . (7.82) 

mK 2 x 10 -3 m 

Nir Ax 



Flux is power per area, so the energy flow — the power — is the flux times 
a person's surface area. A person is roughly 2 meters tall and 0.5 meters 
wide, with a front and a back, so the surface area is about 2 square meters. 
Thus, the power (the energy outflow) is 600 watts. 



Is this heat loss worrisome? 



Even though 600 might seem like a large number, we cannot therefore con- 
clude that 600 watts is a large heat loss: As we learned in Chapter 5, a 
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quantity with dimensions, such as heat flux, cannot be large or small on 
its own. It needs to be compared to a relevant quantity with the same di- 
mensions. A relevant quantity is a normal human power output. When 
sitting around, a person produces 100 watts of heat; that's our basal meta- 
bolic rate. If 600 watts is escaping through your clothing, you are losing 
heat much faster than the basal metabolism is producing it. No wonder 
you feel so cold on a winter day wearing only a thin shirt and pants. Even- 
tually, your core body temperature falls. Then essential chemical reactions 
in your body slow down, because the enzymes lose their shape optimized 
for body temperature and thereby become less efficient catalysts. Eventu- 
ally you get hypothermia and, if it goes on too long, die. 

One solution is to generate heat to make up the difference: by shivering 
or exercising. Cycling hard, which generates, say, 200 watts of mechanical 
power and another 600 watts of heat (thanks to the one-fourth metabolic 
efficiency), should be vigorous-enough exercise to keep you warm, even 
on a winter day in thin clothing. 

Another simple solution is to dress warmly by putting on thick layers. Let's 
recalculate the power loss if you put on a jacket and thick pants, each 2 cen- 
timeters thick. We could redo the power calculation from scratch, but that 
approach is brute force. It is simpler to notice that the gap thickness Ax 
has increased by a factor of 10, yet nothing else changed. Because flux is 
inversely proportional to the gap size, the flux and the power drop by the 
same factor of 10. Therefore, wearing thick clothing reduces the energy out- 
flow to a manageable 60 watts — comparable to the basal metabolism. As a 
result, your body heat can keep you warm. Indeed, when wearing thick 
clothing, only areas exposed directly to cold air, such as your hands and 
face, feel cold. Those regions are protected by only a thin layer of still air 
(the boundary layer analyzed in Section 7.3.4). 

A thick gap means a small heat flux: When it is cold, bundle up! 

Problem 7.28 Thermal conductivity of helium gas 

Estimate the thermal conductivity of helium at standard temperature and pressure. 
The following fact will help you estimate the mean free path: The density of liquid 
helium is 125 grams per liter. 

Problem 7.29 Comfortable outdoor temperature 

You wear only that long thin T-shirt in which the winter temperature of 0 °C felt 
too cold. Estimate the outside temperature that would feel most comfortable. 
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7.4.5 Getting your clothes wet: More thermal conductivities 



If your thick warm coat gets wet, you feel very cold. Let's use our knowl- 
edge of heat flow to explain why the coat becomes so useless. As we know 
from Section 7.4.4, you feel cold when the dimensionless ratio 



energy flow through your coat 
rate at which your body generates heat 



(7.83) 



is significantly larger than 1. The dry coat kept the energy flow (a power) 
comparable to the rate at which your body generated heat. Wetting the coat 
must increase the energy flow significantly. To see how, let's look again at 
the terms in the energy flow and apply proportional reasoning. 

AT 

energy flow = area x R . (7.84) 

flux 



In comparing the wet to the dry coat, the area is unchanged. The tempera- 
ture difference AT is also unchanged: It is still the 30 °C between skin tem- 
perature and winter-air temperature. The gap Ax, which is the thickness of 
the coat, is also unchanged. 

The remaining possibility is that the thermal conductivity K of the gap ma- 
terial has increased significantly. If so, the thermal conductivity of water 
must be much higher than the thermal conductivity of air. Rather than 
studying water directly, let's first estimate the thermal conductivity of non- 
metallic solids. (Metals have an even higher thermal conductivity, as we 
will discuss after studying water.) Using that estimate, we will estimate 
the thermal conductivity of water. 

The problem is again comparative (proportional reasoning): 
thermal conductivity of nonmetallic solids 

1 1 n ? • (7.85) 

thermal conductivity ot air 



The ratio breaks into three ratios, corresponding to the three factors in the 
thermal conductivity (divide-and-conquer reasoning): 

K = density x specific heat x thermal diffusivity. (7.86) 

P S K 



Rather than using these factors directly, let's remix the first two ( pc p ) into 
a more insightful combination. The specific heat itself is 
energy 



Cp mass x AT ' 



(7.87) 
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where AT is the temperature change. 

Multiplying by the density p points us toward an interpretation of pc p : 

energy energy 



pc P = 



mass 

volume 



mass x AT 



volume x AT ’ 



(7.88) 



On the right side, the ratio of energy to volume can be subdivided as 

energy _ energy / volume V 1 

volume mole V mole ) 

After all these reinterpretations, our remix of pc becomes 



(7.89) 



P° P = 



energy / volume \ 
mole x AT V mole J 



-l 



(7.90) 



The first factor is the molar specific heat; it is usually denoted C p (with a 
capital “C" to distinguish it from the usual, per-mass specific heat c p ). The 
volume per mole is also called the molar volume; it is usually denoted V m . 
Therefore, pc p = C p / V m . Then the thermal conductivity becomes 

„ c p 

K = pc p K = TT~K- (7.91) 

‘'m 

In words, the thermal conductivity K is 
molar specific heat C p 

— x thermal diffusivity K. (7.92) 

molar volume V m 

This remix is more insightful than simply pc p K because C p varies less be- 
tween substances than c p does, and V m varies less between substances than 
p does. The remixed form produces less quantity whiplash, in which the 
product swings wildly up and down as we include each factor. Another 
way to express the same advantage is that C p and V m are less correlated 
than are p and c p ; therefore, the remixed abstractions C p and V m offer more 
insight into the thermal properties of materials than do c p and p. 

Using the remixed form, let's apply divide-and-conquer reasoning and es- 
timate the ratio corresponding to each factor. 

1. Ratio of molar specific heats C p . For air, C p was 3.5 R. For most solids, 
whether metallic or nonmetallic, it is similar: 3 R (where the 3 reflects 
the three spatial dimensions). Thus, this ratio is close to 1. 

2. Ratio of molar volumes V m . For any substance, 1 mole has a mass of 
A grams, where A is the dimensionless atomic mass (roughly, the num- 
ber of protons and neutrons in the nucleus). Because a typical solid 
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density is A grams per 18 cubic centimeters (Section 6.4.1), the molar 
volume of the solid is 18 cubic centimeters per mole. In contrast, for air 
or any ideal gas (at standard temperature and pressure), 1 mole occupies 
22 liters or 22 000 cubic centimeters. Thus, the ratio of molar volumes 
(solid to air) is 18/22 000 or roughly 10 -3 . 

3. Ratio of thermal diffusivities k. From Section 7.3.3, this ratio is roughly 0.1: 
thermal diffusivity of nonmetallic solids 10 -6 m 2 s -1 

thermal diffusivity of air 10 _5 m 2 s _1 ( 7 . 93 ) 

= 0 . 1 . 



The ratio of thermal conductivities is the product of the three ratios. As 
long as we remember that the molar volume appears in the denominator 
(the thermal conductivity is inversely proportional to the molar volume), 
we get a ratio of 100: 



K, 



non metallic solid 

~K~ 



x 1CP x IQ" 1 = 10 2 . 

C p ratio (V m ratio ) -1 K ratio 



(7.94) 



Thus, in contrast to the thermal conductivity of 0.02 watts per meter kelvin 
for air, the typical (nonmetallic) solid has a thermal conductivity of about 
2 watts per meter kelvin. 

Now let's use proportional reasoning to compare this thermal conductivity 
to the thermal conductivity of water, which is the gap material in your wet 
coat. We'll estimate K water / K nonmet aiiic solid- The comparison has the same 
three ratios: molar specific heat, molar volume, and thermal diffusivity. 



1 . Ratio of molar specific heats C p . We can find the specific heat of water from 
the definition of a calorie, as the energy required to raise 1 gram of water 
by 1 degree (celsius or kelvin). Thus, the regular specific heat is simple 
to state: 



r water 

c p 



1 cal 

W 



(7.95) 



The resulting molar specific heat is 72 joules per mole kelvin: 



£ water 



leal 4J 18 g 72 J 

x x — — 

gK leal mol molK’ 



(7.96) 



^molar 



The dimensionless specific heat C™ atei /R is therefore approximately 9 
(as you found in Problem 7.27): 
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C 



water 

P 



R 



72 J mol" 1 K" 1 
8Jmol _1 K _1 



(7.97) 



For a nonmetallic solid, the dimensionless specific heat was only 3; it is 
a factor of 3 smaller than for water. Water stores heat very efficiently, 
which is why it is used as coolant fluid and why coastal weather is 
milder than inland weather. The ratio of molar specific heats is 3. 

2. Ratio of molar volumes V m . The molar volume of water, 18 cubic centime- 
ters per mole, is identical to the canonical molar volume of a solid. Thus, 
the molar- volume ratio is 1. 



3. Ratio of thermal dijfusivities k. As we found in Section 7.3.3 by comparing 
phonon mean free paths and propagation speeds, the thermal diffusiv- 
ity K water is roughly a factor of 10 smaller than the thermal diffusivity of 
a typical solid fl(T ' compared to 10 -6 square meters per second). Thus, 
the thermal-diffusivity ratio contributes a factor of 0.1. 



These three ratios result in the following comparison: 



K, 



water 



K, 



nonmetallic solid 



« X X or « 0.3. 

C p ratio (V m ratio ) -1 K ratio 



In absolute terms. 



K, 



water 



0.3 



2 W 

X mK . ~ 
I Y 

^nonmetallic solid 



0.6 W 
mK ' 



(7.98) 



(7.99) 



This conductivity is a factor of 30 larger than K air . As a result, wearing wet 
clothes on a cold day is so unpleasant and can even be dangerous. Thick 
clothing (a coat) allowed a comfortable 60 watts of heat flow — a factor of 10 
lower than the T-shirt allowed. Wetting the thick coat increases the thermal 
conductivity by a factor of 30. The heat loss therefore increases by a factor 
of 30 — making it higher even than the heat loss through the dry T-shirt. 
When you hike in the hills and mountains, bring waterproof clothing! 

The table gives thermal conductivities for everyday substances (at room 
temperature). Our predictions for nonmetals match the data quite well. 
Having examined gases (in particular, air) and nonmetallic solids and liq- 
uids (water), let's turn to the remaining category of material. As the ta- 
ble shows, metals have an even higher thermal conductivity than a typical 
nonconducting solid. (For the unusually high thermal conductivity of dia- 
mond, try Problem 7.33.) 
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Metals, similarly, have a higher thermal diffusivity than 
most other substances. The reason, as we discussed in 
Section 7.3.3, is that, in a metal, heat is conducted not 
only by phonons but also by electrons. The electrons 
move much faster than the speed of sound, and the elec- 
tron mean free path is much greater than the phonon 
mean free path. 

Countering these increases, which increase the diffusiv- 
ity, only a small fraction of the free electrons participate 
in heat conduction. However, this effect is not enough to 
overcome the greater speed and mean free path. Thus, 
metals will have a higher thermal conductivity than non- 
metallic liquids or solids. As a rule of thumb, the typical 
K me tai ' s 200 watts per meter kelvin: a factor of 100 higher 
than that of nonmetallic solid. 

For this reason, a hot piece of metal, such as a seat-belt 
clip in a car outside on a hot day, feels much hotter than 
the plastic button on the same seat-belt clip, even though 
the plastic and metal are at the same temperature. The 
large heat flow from the metal into your finger pulls the 
surface temperature of your finger close to the tempera- 
ture of the hot metal. Ouch! 







diamond 


2000 


Cu 
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A1 


240 


cast iron 


55 


Hg 


8.3 
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2.2 


sandstone 


1.7 


glass 


1 


asphalt 


0.8 
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0.8 


concrete 


0.6 


water 


0.6 


soil (dry) 


0.5 


wood 


0.15 


He (gas) 


0.14 


methane (gas) 


0.03 


air 


0.02 



Problem 7.30 Stone versus wood floors 

Why, on a winter morning, do wood floors feel more comfortable than stone floors? 

Problem 7.31 Mercury is special 

Why does mercury (Hg) have such a low thermal conductivity for a metal? 



7.5 Summary and further problems 

In large, complex systems, the information is either overwhelming or not 
available. Then we have to reason with incomplete information. The tool 
for this purpose is probabilistic reasoning — in particular, Bayesian probabil- 
ity. Probabilistic reasoning helps us manage incomplete information. Us- 
ing it, we can estimate the uncertainty in our divide-and-conquer estimates 
and understand the physics of random walks and thereby viscosity, bound- 
ary layers, and heat flow. 
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Problem 7.32 Reynolds number as a ratio of two times 

For an object moving through a fluid, the Reynolds number is defined as vL/v, 
where v is the object's speed, L is its size (a length), and v is the fluid's kinematic 
viscosity. Show that the Reynolds number has the physical interpretation 

momentum-diffusion time over a distance comparable to the size L 

qq q q S-q q q: • (7.100) 

fluid-transport time over a distance comparable to the size L 

Problem 7.33 Diamond is special 

Diamond has a high thermal conductivity, much higher even than many metals. 
The speed of sound in diamond is 12 kilometers per second, and diamond's spe- 
cific heat Cp is 0.63 kilojoules per kilogram kelvin. Use those values to estimate 
the mean free path of phonons in diamond, as an absolute length and in units of 
typical interatomic spacings. How does the mean free path in diamond compare 
to a typical phonon mean free path of a few lattice spacings? 

Problem 7.34 Baking in three dimensions 

Extend the fish-cooking argument of Section 7.3.3.2 to three dimensions to predict 
the baking time of a 6-kilogram turkey (assumed to be a sphere). How well does 
the time agree with experience (for example, with the data given in Problem 7.22)? 

Problem 7.35 Resistive networks to analyze random walks 

Random walks are closely connected to infinite resistive networks (this connection 
is explored deeply in Random Walks and Electric Networks [11]). In particular, the 
probability of escape p esc — the probability that an n -dimensional random walker 
escapes to infinity and never returns to the origin — is related to the resistance R 
to infinity of a //-dimensional electrical network of unit resistors: p esc = 1/2 nR. 
Use this connection, along with lumping arguments, to estimate R and thereby 
show that the two-dimensional random walk is recurrent (p esc = 0) but that the 
three-dimensional walk is transient (n per > 0) — consistent with Polva's theorem 
(Problem 7.17). 

Problem 7.36 Turning differential equations into algebraic equations 

The cold days of winter arrive, and the ice on a lake cold air (< 0 °C) 

starts thickening as heat flows upward through the 
ice, turning ever more water into ice. Find the seal- ^c at ice 
ing exponent /3 in I 

ice thickness oc (time) f. (7.101) water (0 C) 

Problem 7.37 Thermal and electrical conductivities 

Among the metals, the better thermal conductors, such as copper and gold in com- 
parison to aluminum, iron, or mercury, are also the better electrical conductors. 
(This connection is quantified in the Wiedemann— Franz law.) What is the reason 
for this connection? 
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Problem 7.38 Teacup spindown 



You stir your afternoon tea to mix the milk (and sugar, if 
you have a sweet tooth). Once you remove the stirring 
spoon, the rotation begins to slow. In this problem you'll 
estimate the spindown time r: the time for the angular 
velocity of the tea to fall by a significant fraction. To esti- 
mate t, consider a lumped teacup: a cylinder with height l 
and diameter /, filled with liquid. Tea near the edge of the 
teacup — and near the base, but for simplicity neglect the 
effect of the base — is slowed by the presence of the edge (a 
result of the no-slip boundary condition). 



tea / 



/ 



a. In terms of the viscous torque T, the initial angular velocity ay and p and /, 
estimate the spindown time t. Hint: Consider angular momentum, and drop 
all dimensionless constants, such as n and 2. 

b. To estimate the viscous torque T , use the result of Problem 7.25: 

viscous force = pv x velocity gradient x surface area. ( 7 . 102 ) 

The velocity gradient is determined by the boundary-layer thickness 5 . In terms 
of 5, estimate the velocity gradient near the edge and then the torque T. 

c. Put your expression for T into your earlier estimate for r, which should now 
contain only one quantity that you have not yet estimated (the boundary-layer 
thickness 5). 

d. Estimate 5 in terms of a growth time t, which is the time to rotate 1 radian. 
After 1 radian, the fluid is moving in a significantly different direction, so the 
momentum fluxes from different regions no longer add constructively to the 
growth of the boundary layer. 

e. Put the preceding results together to estimate the spindown time t: symboli- 
cally in terms of v , p, l, and ay and then numerically. 

f. Stir your tea and estimate r experimentally, and compare with your prediction. 
Then enjoy a well-deserved cup. 
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A correct analysis works in all cases — including the simplest ones. This 
principle is the basis of our next tool for discarding complexity: the method 
of easy cases. We will meet the transferable ideas in an everyday example 
(Section 8.1.1). Then we will use them to simplify and understand complex 
phenomena, including black holes (Section 8. 2.2. 2), the temperature of the 
Sun (Section 8.3.2.3), and the diversity of water waves (Section 8.4.1). 



8.1 Warming up 

Let's start with an everyday example, so that we do not have to handle 
mathematical or physical complexity along with learning the new tool. 

8.1.1 Everyday example of easy cases 

One August, upon becoming eligible for one of the tax benefits at work, I 
chose to put $500 in an account for the calendar year's health costs (a feature 
of America's bureaucratic health system). The payroll office advised me 
that they would deduct $125 each month for the rest of the year — namely, 
for August through December. As August is the eighth month and Decem- 
ber the twelfth month, the amount looked correct: 
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$500 



$500 



$125 



12th month — 8th month 4 months month 



(8.1) 



However, in January, the payroll office advised me that they should have 
deducted only $100 per month. 

Which amount was correct? 

The simplest way to decide is the method of easy cases. Don't directly solve 
the hard problem of computing the correct deduction in general. Instead, 
imagine a simpler world in which I started deducting for the year in Decem- 
ber. In this easy case, with only one month providing the year's deduction, 
the answer requires no calculation: Deduct $500 per month. 

Then use the easy case and this result to check the proposed recipe, which 
predicted $125 per month when deductions started in August. In the easy 
case, when deductions start in December, the starting and ending months 
are both the twelfth month, so the recipe predicts an infinite deduction: 

$500 $500 $ infinity 



12th month — 12th month 0 months 



month 



( 8 . 2 ) 



They do not pay me enough to survive that recipe. It needs an adjustment: 
$500 $500 $500 



12th month — 12th month +1 month 1 month month 



( 8 . 3 ) 



Applying this modified recipe to an August instead of a December start, the 
denominator is 5 months rather than 4 months, and the deduction is $100 
per month. The revised advice from the payroll office was correct. 

This analysis contains several features that we can abstract away and use to 
simplify difficult problems. First, the easy cases are specified by values of a 
dimensionless quantity. Here, it is the difference m 2 — m | between the first 
and last month numbers. Second, for particular values of the dimensionless 
quantity — for the easy cases — the problem has an obvious answer. Here, 
the easy case is m 2 — nq = 0. Third, understanding the easy cases transfers 
to the hard cases. Here, our understanding of the case m 2 — in | = 0 shows 
that the denominator should be m 2 — «q + 1 rather than m 2 — wq . 



8.1.2 Easy cases of the shared-birthday probability 

The same easy-cases reasoning helps us check more abstruse formulas. As 
an example, cast your mind back to the birthday paradox of Section 4.4. 
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There, we used proportional reasoning to explain why we need only 23 
people, rather than 183 people, in a room before two people are likely to 
share a birthday. Checking the prediction required the exact probability of 
a shared birthday: 



The last part of Problem 4.23 asked why, with n people in the room, the last 
factor in this probability contains (n — l)/365 rather than n/ 365. 



The simplest answer comes from the easy case n = 1. With one person in 
the room, the probability of sharing a birthday is zero. In this easy case, the 
prediction of the candidate formula with n in the numerator is easy to test. 
Its last factor would be 1 — 1/365: 



^candidate with n 1 




(8.5) 



This probability is 1 /365, which is incorrect. Thus, with n people in the 
room, the last factor in the probability of a shared birthday cannot contain 
n /365. In contrast, the candidate formula with (n — 1) /365 in the last factor 
correctly predicts zero probability when n — 1: 



^candidate with n — 1 






( 8 . 6 ) 



This candidate must be correct. 



Problem 8.1 Sine or cosine by easy cases 

For a mass sliding down an inclined plane, is its acceleration along the plane g sin 9 
or g cos 9, where 6 is the angle the plane makes with horizontal? Use an easy case 
to decide. Assume (the easy case of!) zero friction. 

Problem 8.2 Friction by easy cases 

For the mass of Problem 8.1, relax the assumption of zero friction. If the coeffi- 
cient of sliding friction is /t, choose the block's correct acceleration along the plane: 
(a) g(sin0 + /t cos 9), (b) g( 1 + ]i) sin 9, (c) g(sin 9 — }i cos 9), or (d) g( 1 - //) sin 9. 



8.2 Two regimes 

After warming up with the examples in Section 8.1, let's now look system- 
atically at how to use the method of easy cases. The first step is to identify 
the dimensionless quantity. Once you know it — let's call it /3 — the behavior 
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of the system almost always divides into three regimes: /5 « 1, /5 ~ 1 (or 
[) = I ), and /I » 1 . In a common simplification, which is the subject of this 
section, one regime is impossible, so two regimes remain. (In Section 8.3, 
we will discuss what to do when all three regimes remain.) This simplifi- 
cation can happen because symmetry makes the bookend regimes /3 « 1 
and /3 » 1 equivalent (Section 8.2.1) or because a geometric or physical 
constraint excludes one regime (Section 8.2.2). 

8.2.1 Only two regimes because of symmetry 

The bookend regimes /3 « 1 and /i » 1 are often connected by a symmetry. 
Then the analysis at one bookend can be used as the analysis for the other 
bookend, and there are really two regimes: the symmetric bookends and 
the middle regime. 

8. 2. 1.1 Why multiplication is more important than addition 

As an example, here's an easy-cases explanation of why, when estimating, 
multiplication is a more important operation than addition. Let's say that 
we are estimating a cost, a force, or an energy consumption that splits into 
two pieces A and B to add together — for example, energy for heating and 
for transportation. Estimating A + B seems to require addition. 

The need disappears after we examine the easy-cases regimes. The regimes 
are determined by a dimensionless quantity. Because A and B have the 
same dimensions, their ratio A/B is dimensionless, and it categorizes the 
three easy-cases regimes. 

Regime 1 Regime 2 Regime 3 

A «B A ~ B A» B 
A/B « 1 A/B ~ 1 A/B » 1 

In the first regime, the sum A + B is approximately B . In the symmetric 
third regime, the sum is approximately A. The common feature of the first 
and third regimes — their invariant — is that one contribution dominates the 
other. Only the second regime, where A ~ B, is different. To handle it, just 
make the lumping approximation that A = B; then A + B « 2 A. 

So, when estimating A + B, we do not need addition. In the first and third 
regimes, we just pick the larger contribution, A or B . In the second regime, 
we just multiply by 2. 
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8. 2. 1.2 Area of an ellipse 



A mathematical example of this kind of symmetry happens when 
guessing the area of an ellipse. The dimensionless quantity is its 
aspect ratio n/b, where n is one-half of the horizontal diameter 
and b is one-half of the vertical diameter. Here are the regimes. 




Regime 1 




a/b « 1 
area — > 0 




a/b = 1 



area = na 



Regime 3 




a/b » 1 
area — > 0 



The first and third regimes are identical because of a symmetry: Interchang- 
ing a and b rotates the ellipse by 90° (and reflects it through its vertical axis) 
without changing its area. Therefore, any proposed formula for the area 
must work in the first regime, must satisfy the symmetry requirement that 
interchanging a and b has no effect on the area (thereby taking care of the 
third regime), and must work for a circle (the second regime). 

Because of the symmetry requirement, simple but asymmetric modifica- 
tions of the area of a circle — namely, na 2 and nb 2 — cannot be the area of 
an ellipse. A plausible, symmetric alternative is n{a 2 + b 2 ) / 2. In the sec- 
ond regime, where a = b, it correctly predicts the area of a circle. However, 
it fails in the first regime: The area isn't zero even when a = 0. 

Is there an alternative that passes all three tests? 

A successful alternative is nab. It predicts zero area when a = 0; it is sym- 
metric; and it works when a = b. It is also the correct area. 



8. 2. 1.3 Atwood machine 

In the preceding examples, we had a ready-made dimensionless group and 
used it to categorize the three easy-cases regimes. The next example shows 
what you do when there is no group handy: Use dimensional analysis to 
make one. Then use the group to categorize the regimes and understand 
the system's behavior. Easy cases and dimensional analysis work together. 
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The example is the Atwood machine, a staple of the introductory mechan- 
ics curriculum. Two masses, m 1 and m 2 , are connected by a massless, 
frictionless string and are free to move up and down because of the pul- 
ley. This machine, invented by the Reverend George Atwood, reduces 
the effective gravitational acceleration by a known fraction and makes 
constant-acceleration motion easier to study. (A historical discussion of 
the Atwood machine is given by Thomas Greenslade in [27].) Today its 
principle is used in elevators: The elevator car is m 2 , and the counter- 
weight is m 2 . 




Wlwt is the acceleration of the masses? 



Our second step is to identify the easy-cases regimes, 
but our first step is to find the dimensionless group 
that categorizes them. Therefore, we make our list of 
relevant quantities and their dimensions. The goal 
is the acceleration of the masses. Because they are 
connected by a string, their accelerations have the same magnitude but op- 
posite directions. Let a be the downward acceleration of the mass on the 
left — whether it is labeled m 2 or m 2 (planning for an application of symme- 
try). Because a is our goal quantity, it is the first quantity on the list. The 
motion is caused by gravity, so the list includes g. Finally, the masses affect 
the acceleration, so the list includes and m 2 . And that's the whole list. 

Doesn't the tension also affect the acceleration? 



a 


LT -2 


acceleration 


g 


LT -2 


gravity 


m 1 


M 


left block's mass 


m 2 


M 


right block's mass 



The tension has an important effect: Without the tension, there would be 
no problem to solve, because each mass would accelerate downward at g. 
However, the tension is a consequence of m lr m 2 , and g — quantities already 
on the list. Thus, tension is redundant; adding it would only confuse the 
dimensional analysis. 

This list contains only two independent dimensions: mass (M) and acceler- 
ation (LT -2 ). Four quantities built from two independent dimensions pro- 
duce two independent dimensionless groups. The most natural choices are 
the acceleration ratio a/g and the mass ratio m 2 /m 2 . Then the most general 
dimensionless statement is 




where / is a dimensionless function. 
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Although the next step is to use easy cases to guess the dimensionless func- 
tion, first pause for a moment. The more we think now in order to choose 
a suitable representation, the less algebra we have to do later. Here, the 
ratio nii/m 2 does not respect the symmetry of the problem. Interchanging 
the masses and m 2 turns m 1 jm 2 into its reciprocal: It raises the ratio to 
the —1 power. Meanwhile, because the left mass is now m 2 , which has the 
opposite acceleration to mi, this symmetry operation changes the sign of 
the acceleration a (which measures the downward acceleration of the left 
mass): It multiplies the acceleration by —1. The function f will have to in- 
corporate this change in the nature of the symmetry and will have to work 
hard to turn m 2 /m 2 into an acceleration. Therefore, / will be complicated 
and hard to guess. (You can find it anyway in Problem 8.3.) 



A more symmetric alternative to the ratio m 2 /m 2 is the difference m 2 — m 2 . 
Now interchanging m 2 and rn 2 negates — m 2 , just as it negates the accel- 
eration. Unfortunately, m 2 — m 2 is not dimensionless! It has dimensions of 
mass. To make it dimensionless, we need to divide it by another mass, say 
nil ' t° make (r« | —m 2 ) /ni \ . Unfortunately, that choice abandons our beloved 
symmetry. Dividing instead by the symmetric combination m 2 + m 2 solves 
all the problems. Let's call the result x: 



nil — m 2 
x - — £ . 

nil + m 2 



( 8 . 8 ) 



This ratio is dimensionless and respects the problem's symmetry. With it 
as the dimensionless group, the most general dimensionless statement is 



a =h ( mi-m 2 \ 

g V nil + m 2 ) ' 



( 8 . 9 ) 



where h is a dimensionless function (different from /). (This combination of 
nil ar >d m 2 doesn't satisfy the definition of a group given in Section 5.1.1, as a 
dimensionless product of the quantities raised to various powers. However, 
it is worth extending the concept to include such combinations.) 

To guess h, study the easy-cases regimes according to x. They are three: 
x — — 1 (imagine that m \ = 0), x = 0 (when r« | = m 2 ), and x — +1 (imagine 
that m 2 = 0). Here you see how easy cases help us. The extremity of the 
first and third regimes and the simplicity of the second regime amplify our 
physical intuition and help our gut predict the behavior. 

In the first regime, m 2 falls as if there were no mass on the other end of 
the string. Thus, m 2 rises with acceleration +g. Because a measures the 
downward acceleration of the left mass, a = —g. In the third, symmetric 
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Regime 1 



Regime 2 



Regime 3 




I 



I 



8 

x = -1 
m 1 = 0 
n = -g 



x = 0 
= m 2 
(7 = 0 



8 

x = +1 
m 2 = 0 
a = +g 



regime, wq accelerates downward as if there were no mass on the other end, 
so a = +g. In the second regime, where x = 0 or m , = m 2r the masses are 
in equilibrium, and a = 0. 



Here are the three easy cases plotted on a graph 
of the dimensionless acceleration a/g versus the 
dimensionless mass parameter x. Based on this 
graph, the simplest conjecture, an educated guess, 
is that the full graph of h(x) is the straight line of 
unit slope passing through the three points. Thus, 
its equation is a/g = x. In terms of the masses, the 
equation is 



a/g 

A m 2 = 0 

1 -» 



m\ = m 2 



-1 






a _ m\ ~ m 2 
g m 1 + m 2 ' 



( 8 . 10 ) 




-l 



mi = 0 

In Problem 8.4(d), you solve for the acceleration by 

using Newton's laws to find the tension in the string. Then you'll confirm 
our reasonable conjecture based on easy-cases reasoning. 



Problem 8.3 Using the less symmetric dimensionless group 

Find the dimensionless function / based on the simpler, but less symmetric, di- 
mensionless group nii / m 2 . 




Compare it to the dimensionless function that results from choosing the symmetric 
form (mi ~ + m i) as the independent variable. 
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Problem 8.4 Tension in the string in Atwood's machine 

In this problem you use easy cases to guess the tension in the connecting string. 

a. The tension T, like the acceleration, depends on m lr m 2 , and g. Explain why 
these four quantities produce two independent dimensionless groups. 

b. Choose two suitable independent dimensionless groups so that you can write 
an equation for the tension in this form: 

group proportional to T = /(group without T). (8.12) 

c. Use easy cases to guess /; then sketch /. 

d. Solve for T using freebody diagrams and Newton's laws. Then compare that re- 
sult with your guess in part (c), and use T to find a, the downward acceleration 
of ttli- 

8.2.2 Only two regimes because of a restriction 

In the preceding examples, there were three regimes of a dimensionless 
quantity, but the two bookend regimes were connected by a symmetry op- 
eration and were therefore the same situation. Therefore, there were only 
two qualitatively different regimes. In the next group of examples, there are 
only two regimes for a different reason: A geometric or physical constraint 
excludes one bookend regime. 



8.2.2. 1 Projectile range 



In Section 4.2.3, we used proportional reasoning 
to show that the range of a projectile should have 
the form R ~ v 2 /g, where v is its launch velocity. 
However, we didn't determine how the launch 
angle 6 affects the range. We can do so using easy 
cases, with help from dimensional analysis. 




Now the problem contains four quantities: R, v, g, and R L 
6. Four quantities containing two independent dimen- LT -i 
sions (for example, L and T) produce two independent LT -2 
dimensionless groups. A reasonable pair is the launch a j 
angle 6 — which is already dimensionless — and, based 



range 

launch velocity 
gravity 
launch angle 



on the proportional-reasoning result, Rg/v 2 . Then the most general dimen- 



sionless statement is 




( 8 . 13 ) 
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where / is a dimensionless function. Solving for R, 
v 2 

R = — x f (6). 

8 



( 8 . 14 ) 



Dimensional analysis does not tell us the form of f , but we can guess it 
by examining the easy cases. Easy-cases reasoning is a way of introducing 
physical knowledge. 

Because angles are dimensionless, 6 can categorize the easy-cases regimes. 
The natural easy cases of any quantity are its extremes: 6 « 1 (our usual 
first regime) and 6 » 1 (our usual third regime). The regime 0 « 1 in- 
cludes the smallest launch angle 0 = 0°. Then the range is zero: The pro- 
jectile starts at ground level, is launched horizontally, and hits the ground 
immediately. However, the opposite extreme 6 » 1 is not useful, because 
angles are periodic. Any angle beyond 2n is already handled by an angle 
less than 2n. Our usual third regime is therefore geometrically excluded. 

Our usual second regime is 6 ~ 1. Here, the easy angle comparable to 1 
(in radians) is 6 = 90° (n/2 radians). This angle describes a vertical launch, 
which doesn't produce any range. Thus, R in this regime is also 0. 

The two regimes are therefore 0 = 0 (an example of 6 « 1) and 0 = tt/ 2 (an 
example of 8 ~ 1). 



Regime 1 Regime 2 

0 = 0 0 = n/ 2 

R = 0 R = 0 
f(0) = 0 f(0)= 0 



What reasonable functions satisfy the two easy-cases constraints on f {6)7 



One such function is the product of two straight lines, 
each line ensuring that one of the constraints is satisfied: 



m = e(^-e). 



( 8 . 15 ) 




However, this functional form looks funny. The n/2 in 
the second factor appears by magic, with no obvious ori- 
gin in the equations of free fall. Worse, the guess is not periodic. Increasing 
6 by 2tt shouldn't change the range, but it changes this proposed /. Thus, 
we need to keep looking. 



M3 
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A similar but less magical form, also the product of two 
factors, is 

f(6 ) ~ sin 6 cos 6. (8.16) 

The cos 8 factor ensures that f(n/2) = 0. The sin 0 fac- 
tor ensures that /( 0) = 0. This functional form is also 
periodic. And it has a physical justification: sin 6 is the 
trigonometric factor that converts the launch velocity v into its vertical com- 
ponent Vy ; the cos 6 factor does the same for the horizontal component v x . 
Because these velocities appeared in the proportional-reasoning analysis 
of Section 4.2.3, we already know that they are physically relevant, making 
this functional form more plausible than the preceding form. 

The resulting range, including the effect of the angle, is then 

ip- 

R sin 9 cos 9. (8.17) 

8 

This form is correct, and the dimensionless prefactor turns out to be 2 (as 
you can show in Problem 8.5). 

Problem 8.5 Finding the dimensionless constant 

Extend the proportional-reasoning analysis of Section 4.2.3 to show that the miss- 
ing dimensionless prefactor in the projectile range is 2. 




8. 2. 2. 2 Light bending with large angles 

Our next example of a physical limit excluding a third regime also involves 
an angle: the bending of light by gravity. 

apparent 




In Section 6.4.6, we used lumping to find that the gravitational field from a 
mass m bends light by an angle 

Gm 



Ms 





290 



8 Easy cases 



where r is the distance of closest approach. 

This angle is dimensionless. Thus, let's use it to categorize and to investi- 
gate the easy cases of light bending. The first regime is 8 « 1 — for example, 
the Sun bending starlight by roughly 1 arcsecond. The second regime is 

8 ~ 1. In this regime, a new physical phenomenon appears, and lumping 
will help us analyze it. 

The lumping analysis is based on the American televi- 
sion cartoon of the Road Runner pursued by the Wile 
E. Coyote. The road runner runs across a chasm, fol- 
lowed by the coyote. They cruise as if gravity is gone, 
until the road runner, safely across the gap, holds up 
a sign with an arrow pointing downward explaining, 

"Gravity this way." The coyote looks down, remembers 
the existence of gravity, and falls into the chasm. 

In the coyote model, light zooms past the mass 
(say, a star) ignorant of gravity. After it has gone 
too far — by a distance comparable to r — the star 
holds up a sign saying, "You forgot about grav- 
ity! Please bend by 8 now!" In this second regime, 

9 ~ 1. The following pictures and analysis are 
clearest if 8 = n/2 (or 90°), so let's use 90° to rep- 
resent the 9 ~ 1 regime. 

The ray, on command, bends by 90°. It travels 
along and gets reminded again. The distance of closest approach is still 
r, so 8 is still 90°. The beam traces out a square. Although the ray doesn't 
actually follow this path with its sharp corners, the path illustrates the fun- 
damental feature of the 9 ~ 1 regime: The ray is in orbit around the star. 
When 9 ~ 1, light gets captured by the strong gravitational field. The third 
regime, 8 » 1, doesn't introduce a new physical phenomenon — the light is 
still captured by the gravitational field. In either regime, the mass is a black 
hole. 

Problem 8.6 Guessing a variance 

The variance is a squared measure of the spread in a distribution: 

Varr = (x 2 ) — < x ) 2 . (8.19) 

where (x 2 ) is the average or expected value of x 2 (the mean square) and (x) is the 
average value of x (the mean). 
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a. Use an easy case to convince yourself that the mean square < x 2 } is never smaller 
than the squared mean (xj 1 . 

b. The simplest possible distribution has only two possibilities, x = 0 and x = 1, 
with probabilities 1 — p and p, respectively. This distribution describes a coin 
toss where heads (represented by x = 1) comes up with probability p. Use easy 
cases to guess Varx. 

Problem 8.7 Local black hole 

What is roughly the largest radius that the Earth could have, with its current mass, 
and be a black hole? 



8.3 Three regimes 

Two regimes are easier than three. Therefore, we studied that case first to 
develop experience and identify the transferable ideas. Fortunately, even 
when a complex situation has three regimes, two simplifications are pos- 
sible. First, the bookend regimes are often easier than the middle regime 
(Section 8.3.1). Then we study the bookends and, to predict the behavior 
in the middle, interpolate between the bookends. Alternatively, two effects 
compete and reach a draw in the middle regime (Section 8.3.2). This middle 
regime is then the regime found in nature. 

8.3.1 Three regimes where the bookends are easier 

In a common easy-cases situation, the dimensionless number categorizing 
the three regimes is a ratio between two physical effects. Then the two book- 
ends are the easiest regimes to analyze — because at each bookend, one or 
the other effect vanishes. We'll practice this analysis in an example from 
introductory mechanics (Section 8. 3. 1.1); then we'll graduate to drag (Sec- 
tion 8.3. 1.2). 



8. 3. 1.1 Rolling down the plane 



As our introductory example, let's revisit objects rolling with- 
out slipping down an inclined plane. The goal will be to pre- 
dict an object's acceleration n along the plane. Dimensional 
analysis, from Problem 5.18, tells us that the most general di- 
mensionless statement about n is 



~=f 

8 






mr z 



mass m 




( 8 . 20 ) 
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where / is a dimensionless function, 8 is the incline angle, I is the object's 
moment of inertia, m is its mass, and r is its radius. The only quantities 
that affect the acceleration are the two dimensionless groups: the incline an- 
gle 8 and the dimensionless mass distribution I/mr 2 . The ratio I/mr 2 , and 
therefore the acceleration, is invariant under changes to the object's mass 
or radius (for example, making a bigger ring or disc). In plainer language, 
which however doesn't connect as much to symmetry reasoning, simply 
changing the object's mass or radius without changing its shape does not 
change its acceleration. 

Which shape rolls faster — a ring or a disc? 



In this comparison, the inclined plane and thus the incline angle 8 remain 
fixed. However, the dimensionless group I / mr 2 changes and can categorize 
the easy-cases regimes. This group occurs repeatedly in the analysis, so 
we'll often symbolize it as f>. By understanding the behavior in the regimes 
defined by f, we'll be able to predict the result of the ring-disc race. 

Let's start with the first regime, I /mr 2 = 0. This regime is reached by mak- 
ing I zero. Then rolling, represented by I, becomes irrelevant. The object 
moves as if it were sliding down the plane without friction. Its acceleration 
is then g sin 8, and a/g = sin 8. 



With this easy-cases result, we can simplify the unknown function /, which 
unfortunately is a function of two dimensionless groups. As a reasonable 
conjecture, the dependence on angle is probably still sin 8, even when I is 
not zero. Then the dimensionless statement simplifies to 





( 8 . 21 ) 



where h is a dimensionless function of only one group. 

Let's guess h using easy cases of f> = I/mr 2 . The three regimes will be f> = 0 
(a special case of f> « 1), /3 — 1, and f » 1 . In the first regime, when f> = 0, 
rolling is unimportant, as we just observed. Therefore, a/g = sin 8 and the 
dimensionless function h(f) is just 1. 

The middle regime ft ~ 1 is difficult to analyze without a full calculation. 
Yet the goal of an easy-cases analysis is to make the situation simple enough 
that the behavior is easy to see and we can skip the full calculation. Even 
the simplest value within this regime, f> = 1, is not any easier than other 
values. (It describes the ring, where all mass is distributed at the edge of the 
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object, at a distance r from the center). The middle regime is, as promised, 
the difficult regime. Neither fish nor fowl, it mixes rolling and sliding in 
comparable amounts. 

The solution is to study the third regime, where /3 » 1, and then to inter- 
polate between the first and third regimes in order to predict the behavior 
in the middle regime. (We implicitly used this interpolation approach in 
Section 6.4.3, when we estimated the time and distance for a falling cone to 
reach its terminal speed.) 

For an object rolling down the plane, how can fi be increased beyond 1 ? 



There seems to be no way to increase fi beyond 1: When fi = 1, all mass is 
already distributed at the edge. Fortunately, the subtle point is that, in the 
dimensional analysis, r is the rolling radius, rather than the radius of the 
object, because the rolling radius determines the torque and the motion. 
The object's radius can be larger than the rolling radius. This possibility 
allows us to increase fi as much as we want. 

As an example, imagine a barbell used for weight lifting: A 
thin axle of radius r connects two large and massive ears of 
radii R. Place the axle on the inclined plane, with the ears 
beyond the plane. The rolling radius is the radius r of the axle. 

However, the moment of inertia I is mostly due to the large ears, because 

moment of inertia ex (distance from the axis of rotation) . (8.22) 

Thus, I ~ mR 2 , where m is the combined mass of the two ears, which is 
almost the same as the mass of the whole object. By choosing R » r, we 
can make /i » 1: 





( 8 . 23 ) 



How quickly does this large- fi object accelerate down the plane? 



This extreme regime is easier to think about than the intermediate regime 
fi = 1. The extremity of the fi » 1 regime amplifies our intuition and shouts 
loudly enough for our gut to hear. Then our gut's answer is loud enough 
for us to hear: The barbell, rolling on its small axle, as long as it doesn't slip, 
will creep down the plane (the italics represent my gut's shouting). Going 
fast would, in spinning up the ears, demand far more energy than gravity 
can provide. Thus, as the ears get ever larger and fi goes to infinity, the 
dimensionless rolling factor h(fi) goes to zero. 
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A simple guess that accounts for the two extreme 
regimes is 



HP) = 



l 

1+/T 



( 8 . 24 ) 



sliding limit: h( 0) = 1 

\ 

\ 

N x interpolated 
s ^ guess 



In terms of the original quantities, the accelera- 
tion would be 

gsinff 0 



big-ears limit 



1 + I /mr 2 



( 8 . 25 ) 



This educated guess turns out to be correct (Problem 8.8). Now we can 
answer our original question about which shape rolls faster. The farther 
outward the mass is distributed, the greater /3 is, so /i ring > /3 disc . Because a 
decreases as ft increases, the disc rolls faster than the ring. 



Problem 8.8 Exact solution to rolling down the plane 

Use conservation of energy to find the acceleration of the object along the plane, 
and confirm the guess based on the easy-cases regimes. 

Problem 8.9 A pendulum with chewing gum 

A pendulum with a disc as the pendulum bob is oscillating with period T. You 
then stick a small piece of chewing gum onto the center of the disc. How does the 
gum affect the pendulum's period: an increase, no effect, or a decrease? 




after 



before 





8.3 Three regimes 



295 



8. 3. 1.2 Drag using easy cases 

Drag, like any phenomenon related to fluids, is a hard problem. In par- 
ticular, there is no way to calculate the drag coefficient c d as a function of 
Reynolds number Re — even for simple shapes such as a sphere or a cylinder. 
However, dimensional analysis (Section 5.3.2) has told us that 

drag coefficient = /(Reynolds number). (8.26) 

Ci Re 

However, dimensional analysis could not tell us the function /. That's no 
slur on dimensional analysis; finding the whole function is beyond our 
present-day understanding of mathematics. However, we can understand 
much about / in two easy cases: the bookend regimes Re » 1 and Re « 1. 
In the difficult middle regime Re ~ 1, the function f interpolates between 
its behavior in the two extremes. 

Lozv Reynolds number. Flows at low Reynolds number, although not as fre- 
quent in everyday experience as flows at high Reynolds number, include 
a fog droplet falling in air (Problem 8.13), a bacterium swimming in water 
(discussed in the classic paper "Life at low Reynolds number" [39]), and 
ions conducting electricity in seawater (Problem 8.10). Our goal here is to 
find the drag coefficient in the regime Re « 1. 

The Reynolds number (based on radius) is vr/v, where v is the speed, r is 
the object's radius, and v is the kinematic viscosity of the fluid. Therefore, 
to shrink Re, either make the object small, make the object's speed low, or 
use a fluid with high viscosity. The method does not matter, as long as Re 
is small: The drag coefficient is determined not by any of the individual 
parameters r, v, or v , but rather by the abstraction Re. We'll choose the 
means that leads to the most transparent physical reasoning: making the 
viscosity huge. Imagine, for example, a tiny bead oozing through a jar of 
cold honey. (The tininess of the bead further reduces Re.) 

In this extremely viscous flow, the drag force, as you might expect, is pro- 
duced directly by viscous forces. As you reasoned in Problem 7.25, viscous 
forces, which are proportional to the kinematic viscosity v , are given by 

^viscous ~ P v x velocity gradient x area. (8.27) 

The area is the surface area of the object. The velocity gradient is the rate at 
which velocity changes with distance; it is analogous to the concentration 
gradient An/ Ax in the Fick's-law analysis of Section 7.4.2 or to the tempera- 
ture gradient AT /Ax in the heat-flow analysis of Section 7.4.4. 
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Because the drag force is due to viscous forces directly, it should be pro- 
portional to v . This constraint determines the form of the function / and 
therefore the drag force. Let's write out the dimensional-analysis result 
connecting the drag coefficient and the Reynolds number: 



c d = 



F d 

2PF^cs^ 




(8.28) 



where F d is the drag force and A cs is the object's cross-sectional area. In 
terms of the radius, A cs ~ r 2 , so 




' c d Re 



The viscosity v appears only in the Reynolds number, where it appears in 
the denominator. To make F d proportional to v requires making the drag 
coefficient proportional to 1/ Re. Equivalently, the function/, when Re « 1, 
is given by f(x) ~ l/x. Then 



F d 

pr 2 v 2 



v_ 

vr' 



(8.30) 



For the drag force itself, the consequence is (as you found in Problem 7.26) 

9 9 1 / 

F d ~ pr z u z — = pvvr. (8.31) 



This result is valid for almost all shapes; only the dimensionless prefactor 
changes. For a sphere, the British mathematician George Stokes showed 
that the missing dimensionless prefactor is 6tt: 

F d = 6npvvr. (8.32) 



Accordingly, this result is called Stokes drag. 

High Reynolds number. Because the Reynolds number rv/v contains three 
quantities, the limit of high Reynolds number can also be reached in three 
ways, all of which produce the same behavior. We'll choose the method 
that is easiest to feel, which is to shrink the viscosity v to 0. In this limit, 
called form drag, viscosity disappears from the problem and the drag force 
should not depend on viscosity. This reasoning contains several untruths, 
but they are subtle and the conclusion is mostly correct. (Clarifying the 
subtleties required centuries of progress in mathematics, culminating in 
the theory of singular perturbations and boundary layers, the subject of 
Section 7.3.4 and discussed much more in Physical Fluid Dynamics [46].) 
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Without viscosity and thus without the Reynolds number to depend 
on, the dimensionless function f must be a constant! Its value de- 
pends on the shape of the object and typically ranges from 0.5 to 2. 
Because the drag coefficient is F d / ^pv 2 A cs , the drag force is 

F d ~ pv 2 A cs . (8.33) 



car 0.4 

sphere 0.5 

cylinder 1.0 

flat plate 2.0 



This result agrees with our analysis in Section 3.5.1 based on energy 
conservation. In that analysis, we had implicitly assumed a high Reynolds 
number without knowing it. 



Problem 8.10 Conductivity of seawater 

Estimate the conductivity of seawater by assuming that the current-carrying ions 
(Na + or Cl - ) travel with a shell one layer thick of water molecules around them. 

Problem 8.11 Drag coefficient resulting from Stokes drag 

At low Reynolds number, the drag force on a sphere is F — 6npvvr (Stokes drag). 
Find the drag coefficient c d as a function of Reynolds number, basing the Reynolds 
number on the sphere's diameter rather than its radius. 

Problem 8.12 Choosing the high- or low-Reynolds-number regime 

To estimate the terminal speed of a raindrop (Problem 3.37), you implicitly used 
the high-Reynolds-number regime. Now that you know another regime, how can 
you choose which regime to use? Without knowing which regime, you cannot 
find the terminal speed. Without the terminal speed, how do you find Reynolds 
number, which you need to choose the regime? The answer is to choose a regime 
and see whether the result is self-consistent. If it is not, choose the other regime. 

As practice with this reasoning, assume that the Reynolds number for the falling 
raindrop is small and therefore that the drag force F d is given by Stokes drag: 

F d ~ PairWr. ( 8 . 34 ) 

Estimate the resulting terminal speed v for the raindrop (diameter of about 0.5 
centimeters). Then check the assumption that Re « 1. 

Problem 8.13 Terminal speed of fog droplets 

a. Estimate the terminal speed of fog droplets (r ~ 10 ;m). In estimating the drag 
force, use either the limit of low or high Reynolds numbers — whichever limit 
you guess is more likely to be valid. (Problem 8.12 introduces this reasoning.) 

b. Use the speed to estimate the Reynolds number and check whether you used 
the correct limit for the drag force. If not, try the other limit! 

c. Fog is a low-lying cloud. How long does a fog droplet require to fall 1 kilometer 
(a typical cloud height)? What is the everyday effect of this settling time? 





298 



8 Easy cases 



Interpolation. We now know the drag force in the two extreme regimes: vis- 
cous drag (low Reynolds number) and form drag (high Reynolds number). 
Interpolating between these regimes is easiest in dimensionless form — that 
is, in terms of the drag coefficient rather than the drag force. 

When Re » 1, the drag coefficient c d is roughly 
0.5 (for a sphere). On log-log axes, the Re » 1 
behavior is a straight line with zero slope. In the 
Re « 1 regime, you should have found in Prob- 
lem 8.11 that the drag coefficient (for a sphere) 
is 24/ Re, where the Reynolds number is based 
on the diameter rather than the radius: The 1 / Re 
scaling happens because, as we just deduced, /(Re) ~ 1/Re to make the 
drag force proportional to viscosity v ; the dimensionless prefactor of 24 is 
what you had to compute. On log-log axes, the Re « 1 behavior is also a 
straight line but with a —1 slope. 

The dashed line interpolates between the extremes. The final twist is that 
at Re ~ 3 x 10 5 , the boundary layer becomes turbulent (Problem 7.23), and 
the drag coefficient drops significantly. With that caveat, we have explained 
the main features of drag, a ubiquitous force in the living world. 

Problem 8.14 Q as an easy-cases parameter 

A damped spring-mass system has a dimensionless measure of damping known 
as the quality factor Q, which you studied in Problem 5.53. Choose Q « 0.5, 

Q = 0.5, or Q » 0.5 to describe each of the three regimes: (a) underdamped, 

(b) critically damped, and (c) overdamped. 

Problem 8.15 Floating on water 

Some insects can float on water thanks to the surface tension of water. Extend the 
analysis of this effect from Problem 5.52 by estimating the dimensionless ratio 

_ force due to surface tension 
weight of the bug 

as a function of the bug's size / (as a length). Interpret the regimes R « 1 and 
R » 1, and find the critical bug size l 0 for floating on water. 

Problem 8.16 Energy loss in highway versus city driving 

Explain why the following dimensionless ratio measures the importance of drag: 
mass of fluid swept out 

r u- \ • (8-36) 

mass of object 

An equivalent computation, except for a dimensionless factor of order unity, is 
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fluid density distance traveled 

z — X — , (8.37) 

object density size ot object 

where the size of the object is its linear dimension in the direction of travel. When 
this ratio is comparable to unity, drag significantly affects the trajectory. 

Apply either form of the ratio to a car during city driving, and find the distance 
d at which the ratio becomes significant (say, roughly 1). How does the distance 
compare with the distance between stop signs or traffic lights on city streets? What 
therefore is the main mechanism of energy loss in city driving? How does this 
analysis change for highway driving? 



Problem 8.17 Gain of an RC circuit 

The low-pass RC circuit of Section 2.4.4 is amen- 
able to easy-cases analysis. With the abstraction 
of capacitive impedance (Section 2.4.4), explain 
the unity gain at low frequency and why the gain 
goes to zero at high frequency. What is the dimen- 
sionless way to say "high frequency"? 



Pout 



ground 



Problem 8.18 Bode magnitude sketch for an RC circuit 

A Bode magnitude plot is a log-log plot of |gain| versus frequency. In a lumped 
Bode sketch, which often gives the most insight into the behavior of a system, the 
segments of the plot are straight lines. Make the Bode magnitude sketch for the 
low-pass RC circuit of Problem 8.17, labeling the slopes and intersection points. 



8.3.2 Three regimes where two effects compete 

In the final group of three-regime examples, the three regimes are again 
based on the relative size of two physical effects. However, in contrast to the 
examples in Section 8.3.1, where we chose a regime — for example, by choos- 
ing a flow speed and thus the Reynolds number — here nature chooses. Na- 
ture chooses the middle regime, where the two competing physical effects 
reach a draw. The method of easy cases shows how that choice is made. 

8.3.2. 1 Height of the atmosphere 

Competition between physical effects is the theme of the wonderful Gases, 
Liquids and Solids and Other States of Matter [43, pp. xiii], where we learn 
how "the three primary states of matter are the result of competition be- 
tween thermal energy and intermolecular forces." The following example, 
estimating the height of our atmosphere, illustrates an analogous competi- 
tion: between thermal energy and gravity. Here is how these two physical 
effects determine the height of the atmosphere. 
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1. Thermal energy. Thermal energy gives the molecules speed. Unfettered 
by gravity, the air molecules and the atmosphere would disperse into 
space. Thermal energy is therefore responsible for increasing the atmos- 
phere's height. 

2. Gravity. Gravity pulls air molecules downward. If gravity is the only 
effect, meaning that thermal motion is absent (the atmosphere is at ab- 
solute zero), then gravity pulls the molecules all the way to the ground. 
Gravity therefore decreases the atmosphere's height. 



A dimensionless measure of the relative size of the two effects is a compari- 
son of the typical gravitational energy with the thermal energy of one mol- 
ecule. The typical thermal energy is k B T. With m for the molecule's mass 
and H for the characteristic or typical or scale height of the atmosphere, the 
typical gravitational potential energy is mgH. The energy ratio is then 



mgH 



(8.38) 



This ratio categorizes the three easy-cases regimes: /3 « l,/3 ~ 1, and [5 » 1. 



Regime 1 Regime 2 

thermal 

i 

atmosphere 

expanding stable 

jS « 1 jS ~ 1 

mgH « k B T mgH ~ k B T 



atmosphere 



Regime 3 

t 

gravity 

atmosphere 

contracting 

j8 » 1 

mgH » k B T 



In the first regime, the atmosphere is expanding: The molecules have so 
much thermal energy that they escape farther from Earth. In the third 
regime, the atmosphere is contracting: The molecules do not have enough 
thermal energy to resist gravity as it pulls them back to Earth. The happy 
medium, where the atmosphere is stable, is the middle regime. 

Therefore, the regime is chosen not by us but by nature. The height of the 
atmosphere is determined by the requirement that the two effects compete 
and reach a draw — when the two effects are comparable in strength. In that 
regime, mgH ~ k B T , so the atmosphere's scale height is 
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rr k B T 

H — . (8.39) 

nig 

For the Earth's atmosphere, where T « 300 K and the molecular mass m is 
approximately the mass of a nitrogen molecule, this height is roughly 8 kilo- 
meters — as we predicted in Section 5.4.1 using dimensional analysis. The 
easy-cases reasoning complements the dimensional analysis by providing 
a physical model. 

As a bonus, this general model of competition explains why we guess that 
an unknown dimensionless number is comparable to 1 — for example, when 
we estimated an atomic blast energy in Section 5.2.2. Often, the dimension- 
less number represents a ratio between two physical effects. Then "com- 
parable to 1" means "the effects have reached a draw." By guessing that 
unknown dimensionless numbers were comparable to 1, we foreshadowed 
the easy-cases reasoning that we use for these examples of competition. 

8. 3. 2. 2 Hydrogen’s binding energy 

Our next example, before we turn to the giant scales of stars, is our old 
friend the hydrogen atom. In Section 5.5.1, we predicted its size, the Bohr 
radius a 0 , using dimensional analysis. The prediction was correct, but, as 
with the dimensional analysis of the atmosphere height, dimensional analy- 
sis didn't give us a physical model. Easy-cases reasoning will help us make 
our tacit physical knowledge about hydrogen explicit and thereby build a 
physical model. 

As we found in Section 6.5.2 using lumping, two physical effects compete: 

1. Electrostatics. The electrostatic attraction between the proton and elec- 
tron shrinks the atom. 

2. Quantum mechanics. Via the uncertainty principle, quantum mechanics 
gives an electron confined to a small region a high momentum uncer- 
tainty and, therefore, a high kinetic energy. With this confinement en- 
ergy, analogous to the thermal energy of an air molecule, the electron 
can resist the inward pull of electrostatics. Therefore, quantum mechan- 
ics expands the atom. 

A dimensionless ratio that measures the relative size of the two effects is 
the energy ratio 

I electrostatic potential energy I 

P = T . — r — / (8-40) 

confinement energy 
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where the absolute-value bars say that, although the electrostatic potential 
energy is negative, we care only about its magnitude. 



In a hydrogen atom of radius r, the electrostatic energy is e 2 /Ane 0 r. The 
confinement energy — which we estimated in Section 6.5.2 by using lump- 
ing — is comparable to h z /m e r 2 . Therefore, 



e 2 /47re 0 r 
h 2 /m r 2 



(8.41) 



Regime 1 



P « 1 

expanding 

e 2 ft 2 

47r e 0 r m e r 2 



Regime 2 




/ 3~1 

stable 

e 2 h 2 
4 ne 0 r m e r 2 



Regime 3 




P » 1 

contracting 

e 2 h 2 

47T e 0 r m e r 2 



In the first regime, p « 1, quantum mechanics is much stronger than elec- 
trostatics, and the huge kinetic energy from quantum mechanics (the con- 
finement energy) expands the atom. In the third regime, p » 1, electrostat- 
ics, now much stronger than quantum mechanics, contracts the atom. 

The radius of hydrogen, like the height of the atmosphere, is determined by 
the requirement that two physical effects compete and reach a draw. This 
truce happens in the middle regime. There, e 2 /AneQr ~ h 2 /m e r 2 , so 



m e (e 2 /47re 0 ) 

which is the Bohr radius a 0 that we found using dimensional analysis (Sec- 
tion 5.5.1) and lumping (Section 6.5.2). 

An alternative approach, which also uses easy cases but gives different in- 
sights, is to base the dimensionless ratio on the size r directly. Making r 
dimensionless requires another size. Fortunately, we have one, because 
the two energies, electrostatic and confinement, have different scaling ex- 
ponents. The electrostatic energy is proportional to r . The confinement 
energy is proportional to r~ 2 . 
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Regime 1 Regime 2 Regime 3 




P « 1 P ~ 1 P » 1 

expanding stable contracting 

r « r 0 r ~r 0 r» r 0 



On log-log axes, they are straight lines but with different slopes: 

—2 for the confinement line and —1 for the electrostatic-energy 
line. Therefore, they intersect. The size at which they intersect 
is a special size. Let's call it r 0 . Then our dimensionless ratio is 
p = r/r 0 , and the three regimes are shown in the table. 

The stable middle regime is the one chosen by nature. Then r, 
the radius of hydrogen, is comparable to our special radius r 0 . 

The special radius, and therefore r, is determined by equating 
electrostatic and confinement energies, which is how we found the Bohr ra- 
dius a () in Section 6.5.2. Thus, the radius of hydrogen is the Bohr radius and 
results from competition between electrostatics and quantum mechanics. 

Our next project is to use the easy-cases regimes of size to 
understand thermal expansion — why substances expand 
upon heating. The first step is to sketch the total energy E, 
which is the sum of confinement and electrostatic energies. 

This sketch is easiest in the bookend regimes. In the first 
regime, where r « r 0 , the atom is too small, and quan- 
tum mechanics is stronger: Its kinetic-energy contribution 
is the dominant term in the energy, because its 1/r 2 scal- 
ing overwhelms the 1/r scaling of the potential energy. In 
the third regime, where r » r 0 , the atom is too large, and electrostatics is 
the dominant term in the energy: Its 1/r scaling goes to zero more slowly 
than the 1/r 2 scaling of the kinetic energy. Therefore, in the extremes (the 
bookends), the total energy has two different scalings: 

E cx 1 1/r 2 (regime l:r«r 0 ) 

{ —1/r (regime 3: r » r 0 ) 



^confinement 



^electrostatic 




To complete the sketch, we interpolate between the bookend regimes. 
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The slopes in the two regimes have opposite signs, so there 
is no smooth way to connect the two extreme segments 
without introducing a point where the slope is zero. This 
point, where the energy is a minimum, is nature's choice 
for the ground state of hydrogen. 

With this sketch, we can explain thermal expansion. Ther- 
mal expansion first appeared in Problem 5.38, where you 
used dimensional analysis to estimate a typical thermal-ex- 
pansion coefficient. However, the problem glided over the 
fundamental question: the coefficient's sign. Knowing nothing about ther- 
mal expansion, we might guess that positive and negative coefficients are 
equally likely. Equivalently, substances are equally likely to expand as to 
contract when heated: Some substances might expand, and others might 
contract. However, this symmetry reasoning fails empirically: Almost all 
substances expand rather than contract when heated. The symmetry be- 
tween positive and negative thermal-expansion coefficients, or between ex- 
pansion and contraction, must get broken somewhere. 

The curve of energy versus size for hydrogen shows where the symmetry 
gets broken. In its generic shape, the curve applies to all bonds, whether 
intra-atomic (such as the electron-proton bond in hydrogen), interatomic 
(such as hydrogen-oxygen bonds in a water molecule), or even intermolec- 
ular (such as the hydrogen bonds between water molecules). Bonds result 
from a competition between (1) attraction, which is important at long dis- 
tances and which looks like the electrostatic piece of the generic curve, and 
(2) repulsion, which is important at short distances and which looks like 
the confinement piece of the generic curve. (Even the gravitational bond 
between the Sun and a planet, represented by the curve that you drew in 
Problem 5.55c, has the same shape.) 

Therefore, the curve of bond energy versus separation cannot be symmetric. 
At the r « r 0 end, which is the first regime, r has a minimum possible value, 
namely zero. However, at the r » r 0 end, which is the third regime, r is 
unbounded. Thus, the two bookend regimes are not symmetric, and the 
bond-energy curve skews toward larger r. 

To see how this asymmetry leads to thermal expansion, look at how thermal 
energy affects the average bond length. First, think of the bond as a spring 
(a model that we will discuss further in Section 9.1). As the bond vibrates 
around its equilibrium length r 0 , thermal energy, which is a kinetic energy. 
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^confinement 




electrostatic 
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converts into and out of potential energy in the bond. The minimum bond 
length r min and the maximum bond length r max are determined by where 
the vibration speed is zero — where the bond has slurped up all the kinetic 
energy (the thermal energy) and turned it into potential energy. 




cold warm hot 



Graphically, we draw a horizontal line at a height E t hermai above the mini- 
mum energy. This line intersects the potential-energy curve twice, at r = 
r min and at r = r max . Because the potential-energy curve skews to the right, 
toward r » r 0 , the average bond length r avg = (r min + r max )/2 is larger 
than r 0 . As E thermal grows, the skew affects the average more, and the differ- 
ence between r avg and r 0 grows: Adding thermal energy increases the bond 
length. Therefore, substances expand when heated. 



8. 3. 2. 3 Core temperature of the Sun 

As our final example, we'll estimate the temperature of the core of the Sun. 
The Sun, like the atmosphere, is a result of competition between thermal 
motion and gravity: Thermal motion expands the Sun; gravity compresses 
the Sun. A dimensionless measure of the two effects' relative strength is 
thermal energy 

P = t • (8-44) 

| gravitational potential energy 



The numerator, the thermal energy for one particle, is just k B T. For the 
denominator, we need to know the mass of a particle. The Sun itself is made 
of hydrogen, but in the core of the Sun, the thermal energy is more than 
enough to strip an electron from the proton (as you verify in Problem 8.19). 
Each proton gets gravitational potential energy from the rest of the Sun, 
which has mass M Sun . The rest of the Sun, if lumped into a massive particle, 
would be at a distance comparable to R Sun , where R Sun is the Sun's radius. 
Therefore, the typical gravitational potential energy (per particle) is 



E 



gravitational 



GM Sun m p 

^Sun 



(8.45) 
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where the ~ contains the minus sign in the potential energy. The ratio of 
energies is therefore 

^ ~ GM Sun m p /R Sun ' (8 ' 46) 

and it produces the following three regimes. 



Regime 1 Regime 2 




Regime 3 




too hot, expanding 
jS » 1 



k B T » 



GM Sun tfi p 

R Sun 



In the first regime, the Sun is too cold for its size, so gravity wins the com- 
petition and compresses the sun. This contraction shows the difference be- 
tween the thermal-gravitational competition in the Sun and in the Earth's 
atmosphere (Section 8. 3. 2.1). The temperature of the atmosphere is deter- 
mined by the blackbody temperature of the Earth, roughly 300 K (Prob- 
lem 5.43). In our analysis of the height of the atmosphere, we therefore 
held the temperature fixed and let only the height vary until it produced 
the gravitational energy to match the fixed thermal energy. In the Sun, how- 
ever, the temperature is determined by the speed of the fusion reactions, 
which depends on the temperature and the density: Higher density means 
more frequent collisions that might result in fusion, and higher tempera- 
ture (faster thermal motion) means that each collision has a higher chance 
of resulting in fusion. Thus, as the Sun contracts, the density increases, as 
does the temperature and the reaction rate. The contraction stops when the 
temperature is high enough for thermal motion to balance gravity. 

(There is a caveat. If a star contracts very quickly, its core can heat up faster 
than the negative-feedback process can oppose the change. Then the core 
ignites like a giant hydrogen bomb and the star becomes a supernova.) 

In the third regime, the Sun is too hot for its size, so thermal motion wins 
the competition and expands the Sun. As the Sun expands, the reaction rate. 
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the temperature, and the thermal energy fall — until thermal motion again 
balances gravity. The result is the middle regime. In the middle regime, 
the Sun has just the right temperature — determined by the condition k B T ~ 
GM Sun m p /R Sun , so 



GM Sun m p 

^B^Sun 



(8.47) 



The proton mass m p in the numerator and Boltzmann's constant k B in the de- 
nominator are easier to handle if we multiply each constant by Avogadro's 
number N A . The product N A k B is the universal gas constant R. The product 
m p N A is the molar mass of protons, which is approximately the molar mass 
of hydrogen: 1 gram per mole. Then 

£ M Sun N A m p 

6.7 x 10 -11 kg -1 m 3 s -2 x 2xl0 30 kg x 10” 3 kgmol _1 

T T t 2— • (8.48) 

8Jmol _1 K _1 x 0.7 x 10 9 m 

N A k B Ksun 



To evaluate T, start with the most important piece. 

1 . Units. The inverse moles in the numerator and denominator cancel. The 
numerator then contributes kg m 3 s~ 2 , which is J m. The denominator 
contributes JK -1 m. Therefore, the Jm in the numerator and denom- 
inator cancel, and the inverse kelvins in the denominator become the 
kelvins in the Sun's core temperature. 

2. Powers often. The numerator contributes 16 powers of ten; the denomi- 
nator contributes 9. The result is 7 powers of ten. Thus, the temperature 
will be comparable to 10 7 K. 

3. Numerical prefactor. The 6.7 in the numerator is mostly canceled out by 
the 8 x 0.7 in the denominator, leaving only the 2 in the numerator. Thus, 
the temperature is roughly 2 x 10 7 K. 

No one has measured the internal temperature directly, but the current best 
estimate for the core temperature is 1.5 x 10 7 K. Our easy-cases model of 
competition between gravity and thermal motion is surprisingly accurate. 

Problem 8.19 Thermal versus electronic energy 

Compare the thermal energy (per particle) in the core of the Sun to the binding 
energy of hydrogen. Were we justified in assuming that the electrons get stripped 
from the protons? 
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8.4 Two dimensionless quantities 

In the examples of Section 8.3, one dimensionless quantity categorized the 
easy-cases regimes. Whether the quantity was small, comparable to 1, or 
large, the regimes could all be arranged on one axis. To extend that idea, 
we'll use easy cases to categorize the world using two dimensionless quanti- 
ties and therefore two axes. For practice, we'll organize the world of waves 
onto a two-dimensional map (Section 8.4.1). Then we'll organize the four 
fundamental branches of physics (Section 8.4.2). 



8.4.1 The two-dimensional world of water waves 



Water waves come in several varieties. Perhaps the most evident regime 
represents waves near a beach. These waves, the subject of Problem 5.15, 
have a wavelength much larger than the depth of the water. You can make 
them at home: Fill a bathtub or baking dish with water, disturb the water, 
and create waves sloshing back and forth. Their speed v is given by v 2 = gh, 
where h is the depth of the water. 



Before these shallow-water waves reached the beach, they traveled 
on the open ocean, where their wavelength was much smaller than 
the depth. Their speed — technically, the phase velocity — is given by 
v 2 = gA (Problem 5.11), where A is the reduced wavelength A /2n. 

These two regimes — deep and shallow water — are distinguished by 
the dimensionless ratio h/A. (You can also use h/A, but the math- 
ematical descriptions turn out to be simpler using h/A.) Therefore, 
the two regimes sit on a dimensionless axis that measures depth. 
Because the axis measures depth, let's orient the axis vertically and 
place deep water at the bottom. 




Another familiar kind of wave is produced by dropping a pebble in a pond. 
Small ripples zoom outward from the point of impact. These waves have 
a small wavelength, much smaller than the depth of the pond. Therefore, 
h/A is large — as it is for deep-water waves. 



However, these ripples are different from the deep-water waves on the open 
ocean. Ocean waves are driven by the water's weight — that is, by gravity. In 
contrast, ripples are driven by the water's surface tension — the same effect 
that allows small bugs to walk on water (see Problem 8.15). In order to 
distinguish ripples from deep-water gravity waves, our second axis should 
measure the relative importance of gravity and surface tension. 
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This axis is therefore labeled by the ratio of gravity to surface tension. This 
ratio is a dimensionless group, so we can find it using dimensional analysis. 
The ratio will depend on two characteristics of the water: its density p (to 
make the weight) and its surface tension 7. It will depend on gravity g 
(to make the weight). And it will depend on the (reduced) wavelength A. 
These four quantities, built from three dimensions, form one independent 
dimensionless group. A useful choice is pgA 2 /j. 

At long wavelengths (large A), in dense fluids (large p), or in strong gravity 
(large g), this dimensionless ratio is large, indicating that gravity drives 
the waves. For short wavelengths (ripples) or, equivalently, high surface 
tension, this ratio is small, indicating that surface tension drives the waves. 
Here are the three regimes categorized using both groups. 




The empty corner stares at us, asking to be filled. It incorporates both limits: 



h 

A 



0 



(shallow water) 



and 




(ripples). 



( 8 . 49 ) 




These waves are therefore ripples on shallow water. (They are hard to make, 
because ripples are tiny, smaller than a few millimeters, yet the water depth 
must be even smaller.) 
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The four corners are the bookend regimes on two axes. If each axis had 
three regimes, two axes could have made three squared or nine regimes. 
The missing five middle regimes can be constructed by interpolating be- 
tween the corners. The easy-cases map shows how: First fill in the four re- 
gions between the corners, and then fill in the center. That analysis, which 
involves addition and a tanh function, gives the following map. 




On this map, the middle regimes are not our usual middle regimes. Our 
usual middle regimes represented a particular regime (which was usually 
of the form fi ~ 1). However, on this map, the middle regimes represent 
the general solution ji = anything. As an example, look at the bottom, 
deep-water row of three regimes. The bookend regimes, deep-water grav- 
ity waves and deep-water ripples, are easy cases of the middle, deep-water 
regime. For fun, check the other limiting cases, including that the central 
regime — which covers waves driven by any mixture of gravity and surface 
tension and traveling on any depth of water — turns into the other eight 
regimes in the appropriate limits. 

8.4.2 The two-dimensional world of physics 

Now we'll use the same method to organize the four fundamental branches 
of physics: classical (Newtonian) mechanics, quantum mechanics, special 
relativity, and quantum electrodynamics. 
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As a first step, which will determine the first axis, let's compare classical 
mechanics and special relativity. Special relativity is Einstein's theory of 
motion. It unifies classical mechanics and classical electrodynamics (the 
theory of radiation), giving a special role to the speed of light c. This role is 
popularized in the T-shirt showing Einstein in a policeman's hat. He holds 
out his arm and palm to make a "Stop" signal and warns: "186 000 miles 
per second. It's not just a good idea. It's the law!" The speed of light is the 
universe's speed limit, and special relativity obeys it. In contrast, classical 
mechanics knows no speed limit. Classical mechanics and special relativity 
therefore sit on an axis connected by the speed of light. In the limit c — > oo, 
special relativity turns into classical mechanics. 



special 


C — > OO 


classical 


relativity P 




mechanics 



For the second axis, we compare one of the two remaining branches of 
physics — either quantum mechanics or quantum electrodynamics — with 
either classical mechanics or special relativity. Because quantum electro- 
dynamics, if only from its name, looks frightening, let's select quantum 
mechanics. We've seen its effect several times: Quantum mechanics con- 
tributes a new constant of nature h. This constant appears in the Heisenberg 
uncertainty principle ApAx ~ h, where A p and Ax are a particle's momen- 
tum and position uncertainties, respectively. The Heisenberg uncertainty 
principle restricts how small we can make these uncertainties, and there- 
fore how accurately we can determine the position and momentum. 

However, if h were zero, then the uncertainty principle would not restrict 
anything. We could exactly determine the position and momentum of a 
particle simultaneously, as we expect in classical mechanics. Classical me- 
chanics is the h — ► 0 limit of quantum mechanics. Therefore, classical and 
quantum mechanics are connected on a second, h axis. The map including 
quantum mechanics therefore has two dimensions. 
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In this two-dimensional map of physics, one corner sits empty. Further- 
more, one branch of physics — quantum electrodynamics — hasn't been con- 
sidered. We need only a bit of courage to set quantum electrodynamics in 
the empty corner. Quantum mechanics must be the c — > oo limit of quan- 
tum electrodynamics. And it is. Quantum electrodynamics is the result of 
marrying special relativity (c < oo) and quantum mechanics ( h > 0). Thus, 
in the h —> 0 limit, quantum electrodynamics turns into special relativity. 




What happened to the bookend regimes? 

The bookend regimes are here implicitly, because this example introduced 
a new feature: These axes are not labeled using dimensionless quantities! 
Only for a dimensionless quantity can we sensibly distinguish the three 
regimes « 1, ~ 1, and » 1. Because c and h have dimensions, the only 
valid comparisons are with zero or with infinity (which are zero and infin- 
ity in any system of units). Thus, there are only two regimes on each axis. 
The special-relativity-classical-mechanics axis compares c with infinity; the 
quantum- mechanics-classical-mechanics axis compares h with zero. 



8.5 Summary and further problems 

When the going gets tough, the tough lower their standards. In this chapter, 
you learned how to do that by studying the easy cases of a problem. This 
tool is based on the idea that a correct solution works in all cases, includ- 
ing the easy cases. Therefore, look at the easy cases first. Often, we can 
completely solve a problem simply by understanding the easy cases. 

Problem 8.20 Easy cases for the period of a pendulum 

Does the period of a pendulum increase, decrease, or remain constant as the am- 
plitude is increased? Decide by selecting an amplitude for which you can easily 
predict the period. 
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Problem 8.21 Pyramid volume 

Use easy cases to find the dimensionless prefactor in the 
volume of a pyramid with height h and a square ( b x b) 
base: 

V = dimensionless prefactor x b 2 h. (8.50) 

In particular, choose the easy case of a pyramid that, with 
a few more copies of itself, can be assembled into a cube. 




Problem 8.22 Power means 



The arithmetic and geometric means are easy cases of a higher-level abstraction: 
the power mean. The kth power mean of two positive numbers a and b is defined 
by 



M k (a,b) 




(8.51) 



You raise the numbers to the fcth power, take the (regular) mean, and then undo 
the exponentiation by taking the kth root. 

a. What is k for an arithmetic mean? 

b. What is k for the rms (root mean square)? 

c. The harmonic mean of a and b is sometimes written as 2 (a || b), where || denotes 
the parallel combination of a and b (introduced in Section 2.4.3). What is k for 
the harmonic mean? 

d. (Surprising!) What is k for the geometric mean? 



Problem 8.23 Easy case of the compound pendulum 

For the compound pendulum of Problem 5.25, what easy case produces an ordi- 
nary, noncompound pendulum? Check that, in this limit, your formula for the 
period from Problem 5.25 behaves correctly. 



Problem 8.24 Means in an elliptical orbit 

A planetary orbit (an ellipse) has two important 
radii: r min and r max . The other lengths in the el- 
lipse are power means of these radii (see Prob- 
lem 8.22 about power means). 

Match the three power means — arithmetic, geo- 
metric, and harmonic — to the three lengths: the 
semimajor axis a (which is related to the orbital 
period), the semiminor axis b, and the semilatus 
rectum / (which is related to the orbital angular 
momentum). 




Hint: The power-mean theorem says that M,„ ( a,b ) < M n ( a,b ) if and only if m < n . 
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Problem 8.25 Four regimes in orbital motion 

Once in a while, there are four interesting easy-cases regimes. An example is orbits. 
The dimensionless parameter /3 that characterizes the type of an orbit is 

kinetic energy 

f = t : 7/ (8.52) 

| gravitational potential energy | 



where the absolute value handles a possibly negative potential energy. 

A related dimensionless parameter is the orbit eccentricity e. In terms of the ec- 
centricity, a planet's orbit in polar coordinates is 



r{9) = 



l 

1 + e cos 9 ' 



(8.53) 



where the Sun is at the origin, and 1 is the length scale of the orbit (l is diagrammed 
in Problem 8.24). Sketch and classify the four orbit shapes according to their values 
of /3 and e (giving a point value or a range, as appropriate): (a) circle, (b) ellipse, 
(c) parabola, and (d) hyperbola. 



Problem 8.26 Superfluid helium 

Helium, when cold, turns into a liquid. When very cold, the liquid turns into a 
superfluid — a quantum liquid. Here is a dimensionless ratio determining how 
quantum the liquid is 

_ quantum uncertainty in the position of a helium atom 
P ~ separation between atoms 

a. Estimate /3 in terms of the quantum constant h, helium's density p (as a liquid), 
the thermal energy fcgT, and the atomic mass m He . 

b. In the /3 — 1 regime, helium becomes a quantum liquid (a superfluid). Thus, 
estimate the superfluid transition temperature. 



Problem 8.27 Adiabatic atmosphere 

The simplest model of the atmosphere is isothermal: The atmosphere has one tem- 
perature throughout it. A better approximation, the adiabatic atmosphere, relaxes 
this assumption and incorporates the adiabatic gas law: 

OC 1, (8.55) 

where p is atmospheric pressure, V is the volume of a parcel of air, and y = c p /c v 
is the ratio of the two specific heats in the gas. (For dry air, y = 1.4.) Imagine an 
air parcel rising up a mountain. As the parcel rises into air with a lower pressure, 
it expands, and its volume and temperature change according to a combination of 
the adiabatic and ideal gas laws. 

a. What easy case of y reproduces the isothermal atmosphere? 

b. For y = 1.4 (dry air), will air temperature decrease with, increase with, or be 
independent of height? 
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Problem 8.28 Loan payments 

A fixed- term, fixed- interest-rate loan has four important parameters: the principal 
P (the amount borrowed), the interest rate r , the repayment interval r, and the 
number of payments n . The loan is repaid in n equal payments over the loan term 
)7T. Each payment consists of a principal and an interest portion. The interest 
portion is the interest accumulated on the principal outstanding during the term; 
the principal portion reduces the outstanding principal. 

The dimensionless quantity determining the type of loan is /3 = nrr. 

a. Estimate the payment (the amount per term) in the easy case /3 = 0, in terms 
of P, n, and r. (The term nr and the repayment interval r don't vary that 
much — t is usually 1 month and ut is somewhere between 3 to 30 years — so 
/3 « 1 is usually reached by lowering the interest rate r .) 

b. Estimate the payment in the slightly harder case where /3 « 1 (which includes 
the /3 = 0 case). In this regime, the loan is called an installment loan. 

c. Estimate the payment in the easy case /3 » 1. In this regime, the loan is called 
an annuity. (This regime is usually reached by increasing n.) 

Problem 8.29 Heavy nuclei 

In this problem, you study the innermost electron in an atom such as uranium that 
has many protons, and analyze a surprising physical consequence of its binding 
energy. Imagine a nucleus with Z protons around which orbits one electron. Let 
E(Z) be the binding energy (the hydrogen binding energy is the case Z = 1). 

a. Show that the ratio E(Z) /E(l) is Z 2 . 

b. In Problem 5.36, you showed that E(l) is the kinetic energy of an electron mov- 
ing with speed txc where a is the fine-structure constant (roughly 10 -2 ). How 
fast does the innermost electron move around a heavy nucleus with charge Z? 

c. When that speed is comparable to the speed of light, the electron has a kinetic 
energy comparable to its (relativistic) rest energy. One consequence of such a 
high kinetic energy is that the electron has enough kinetic energy to produce a 
positron (an anti-electron) out of nowhere; this process is called pair creation. 
That positron leaves the nucleus, turning a proton into a neutron as it exits: 
The atomic number Z decreases by one. The nucleus is unstable! Relativity 
therefore places an upper limit to Z. Estimate this maximum Z and compare it 
with the Z for the heaviest stable nucleus (uranium). 

Problem 8.30 Minimum wave speed 

For deep-water waves, estimate the minimum wave speed in terms of p, g, and y 
(the surface tension). Test your prediction in two different ways. (1) Drop a peb- 
ble into water, and observe how fast the slowest ripples move outward. (2) Move 
a toothpick through a pan of water, and look for the fastest speed at which the 
toothpick generates no waves. 
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Problem 8.31 Surface tension and the size of raindrops 

A liquid's surface tension, usually denoted 7 , is the energy required to create new 
surface divided by the area of the new surface. For a falling raindrop, surface ten- 
sion and drag compete: the drag force flattens the raindrop, and surface tension 
keeps it spherical. If the drop gets too flat, it can lower its surface energy by break- 
ing into smaller and more spherical droplets. The fluid dynamics is complicated, 
but we don't need to know it. Instead, use competition reasoning (easy cases) to 
estimate the maximum size of raindrops. 

Problem 8.32 Waves driven by surface tension 

Imagine a wave with reduced wavelength A on the surface of a fluid, 

a. Show that the dimensionless ratio 

potential energy due to gravity 



R = 



(8.56) 



potential energy due to surface tension 

is, after ignoring dimensionless constants, the dimensionless group pgft 2 /y 
that we used in Section 8.4.1 to distinguish waves driven by gravity from waves 
driven by surface tension. 

b. For water, estimate the critical wavelength A at which R ~ 1. 

Problem 8.33 Including buoyancy 

The terminal speed v of a raindrop with radius r can be written in the following 
dimensionless form: 

71 2 



_ r ( P water ) 
r g ~ 1 V Pair ) ' 



(8.57) 



In this problem, you use easy cases of x = p wa ter//°air to guess how buoyancy affects 
this result. (Imagine that you may vary the density of air or water as needed.) 
In dimensional analysis, including the buoyant force requires including p aiT , g, 
and r in order to compute the weight of the displaced fluid (which is the buoy- 
ant force) — but those variables are already included in the dimensional analysis. 
Therefore, including buoyancy doesn't require a new dimensionless group. So it 
must change the form of the dimensionless function f. 

a. Before you account for buoyancy: What is the dimensionless function f(x)7 
Assume spherical raindrops and that c d ~ 0.5. 

b. What would be the effect of buoyancy if p water were equal to p air ? This thought 
experiment is the easy case x = 1. Therefore, find /( 1). 

c. Guess the general form of / with buoyancy, and thereby find v including the 
effect of buoyancy. 

d. Explain physically the difference between v without and with buoyancy. Hint: 
How does buoyancy affect g? 
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Our final tool for mastering complexity is making spring models. The essen- 
tial characteristics of an ideal spring, the transferable abstractions, are that 
it produces a restoring force proportional to the displacement from equilib- 
rium and stores an energy proportional to the displacement squared. These 
seemingly specific requirements are met far more widely than we might 
expect. Spring models thereby connect chemical bonds (Section 9.1), xylo- 
phone notes (Section 9.2.3), gravitational radiation (Section 9.3.3), and the 
colors of the sky and sunsets (Section 9.4). 



9.1 Bond springs 

A ubiquitous spring is the bond between the electron and proton in hydro- 
gen — the bond that is our model for all chemical bonds. In Section 9.1.1, 
we'll build a spring model of hydrogen, giving us a physical model for the 
Young's modulus (Section 9.1.2) and for the speed of sound (Section 9.1.3). 

9.1.1 Finding the spring 

In Section 8. 3. 2. 2, we saw how hydrogen is a competition between electro- 
statics and quantum mechanics. When the electron-proton separation x is 
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much smaller than the Bohr radius a 0 , quantum mechanics wins, and the 
net force on the electron is outward (positive). When the separation is much 
larger than the Bohr radius, electrostatics wins, and the net force is inward 
(negative). Between these two extremes, when the separation is the Bohr 
radius (when x = n 0 ), the force crosses the zero-force line (the x axis). 

This equilibrium point is an easy case that helps us de- 
scribe the bond force simply. We magnify the force curve 
at the equilibrium point. The curve now looks straight, 
because any curve looks straight at a large-enough mag- 
nification. Equivalently, any curve can be approximated 
locally by its tangent line — which is an example of lump- 
ing shapes and graphs (Section 6.4) and is where spring 
models discard actual information and complexity. 

Physically, the straight-line approximation means that, 
as long as the bond distance differs from n 0 by only a small amount Ax, the 
force is linearly proportional to the deviation Ax. Furthermore, the force 
curve has a negative slope: A negative deviation produces a positive force, 
and vice versa. The force therefore opposes the deviation and is a restor- 
ing force. A linear restoring force is the force from an ideal spring, so the 
electron-proton bond is an ideal spring! It has an equilibrium length n 0 at 
which F = 0 and spring constant k, where — k is the slope of the force curve. 

F = —A: Ax. (9.1) 

To make this small Ax stretch, the required energy AE is 

A E ~ force x Ax. (9.2) 

Because the force varies from 0 to k Ax, the typical or characteristic force is 
comparable to A Ax. Then 

AE ~ kAx ■ Ax = k( Ax) 2 . (9.3) 

The scaling exponent connecting AE and Ax is 2. This quadratic depen- 
dence on the displacement is the energy signature of a spring. For small dis- 
placements around the equilibrium point (the minimum), the energy-ver- 
sus-displacement curve is a parabola. (This analysis is a physical version of 
a Taylor-series approximation.) 

Because almost any energy curve has a minimum, almost every system con- 
tains a spring. For the bond spring, the energy relation AE ~ A: (Ax) 2 gives 
us an estimate of the spring constant k. Pretend that the energy curve is ex- 
actly a parabola, even for large displacements, and increase Ax to the bond 



F 
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size a (for hydrogen, a is the Bohr radius a 0 ). That stretch requires an energy 
AT ~ kn 2 . It is a characteristic energy of the bond, so it must be comparable 
to the bond energy E 0 . Then E 0 ~ kn 2 , and 

k ~ % (9.4) 

a 1 

This relation is an estimate for the spring constant in terms of quantities 
that we know in other ways: E 0 from the heat of vaporization and a from 
the density and atomic mass. The estimate is often off by a factor of 3 or 10, 
because of the inaccuracy in extending the parabolic, ideal-spring approxi- 
mation to displacements comparable to the bond length. But it gives us an 
order of magnitude that will be useful in subsequent estimates. 



9.1.2 Young’s modulus 



From the spring constant of one bond, we could 
find the spring constant of a block of material. But ^ 
as we discussed in Section 5.5.4, a better measure 
is the Young's modulus Y: It is an intensive quan- 
tity, so not dependent on the block's dimensions. The Young's modulus is 
measured by stretching a block of material with a force F at each end. 
stress 



Y = 



(9.5) 




strain 

The stress is F /A, where A is the block's cross-sec- 
tional area. Estimating the strain requires more 
steps. Fortunately, the estimate breaks into a tree. 

To grow it, imagine the block as a bundle of fibers, 
each a chain of springs (bonds) and masses (atoms). Because strain is the 
fractional length change, the strain in the block is the strain in each fiber and 
the strain in each spring of each fiber: 
spring extension 



strain = , , 

bond length a 

That's the root of the tree. Here are the internal nodes: 

force/ fiber 



spring extension 
force F 



spring constant k ' 



fiber N fibers ' 

The number of fibers will be 



(9.6) 



(9.7) 



(9.8) 
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N _ cross-sectional area A 
fibers p er -fiber cross-sectional area a 2 

Here is how the leaf values propagate to the root. 

1 . The force per fiber becomes Fa 2 /A. 

2. The per-spring extension becomes Fa 2 /kA. 

3. The strain becomes Fa/kA. 



Finally, the Young's modulus becomes k /a: 

y _ stress _ F/A _ k 
~ strain Fa/kA a' 



( 9 . 10 ) 



Because k ~ E 0 /a 2 (Section 9.1.1), Y ~ E 0 /a 3 , which 
confirms with a spring model our dimensional-analysis 
prediction in Section 5.5.4. 



9.1.3 Sound speed in solids and liquids 



strain 

Fa/kA 




spring bond 

extension length 
Fa 2 /kA a 




t° rce spring 

fiber constant 
Fa 2 /A fc 




force Nfibers 

F A/ a 2 



The spring model of solids and even liquids also gives a physical model for 
the speed of sound. Start with a fiber of atoms and bonds: 



©WWWWV\©WWWWV\© • • • 

^bond H 

The sound speed is the speed at which a vibration signal travels along the 
fiber. Let's study the simplest lumped signal: In one instant, the first mass 
moves to the right by a distance x, the signal amplitude. 



© 0WV\AAA/W\@WWV\MAA@ ■ • ■ 

The compressed bond pushes the second atom to the right. When the sec- 
ond atom has moved to the right by the signal amplitude x, the signal has 
traveled one bond length. The sound speed is the propagation distance 
fl bon d divided by this one-bond propagation time. 



© ©WWW VV)A0V\AAA/WW@ ■ ■ ■ 

What is the propagation time? That is, roughly how much time does the second 
mass require to move to the right by the signal amplitude x? 
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The second mass moves because of the spring force from the first spring. 
At t = 0, the spring force is kx. As the mass moves, the spring loses com- 
pression, and the force on the second mass falls. So an exact calculation of 
the propagation time requires solving a differential equation. But lumping 
will turn the calculation into algebra: Just replace the changing spring force 
with its typical or characteristic value, which is comparable to kx. 



The force produces a typical acceleration n ~ kx/m. After a time t, the 
mass will have a velocity comparable to at and therefore will have moved 
a distance comparable to at 2 . This force acts for long enough to move the 
mass by a distance x, so at 2 ~ x. Using a ~ kx/m gives kxt 2 /m ~ x. The 
amplitude x cancels, as it always does in ideal-spring motion. (Thus, the 
sound speed does not depend on loudness.) The propagation time is then 



(9.11) 



This characteristic time is just the reciprocal of the natural frequency cv 0 — 
«]k/m . In this time, the signal travels a distance fl bond , so 

distance traveled fl bo nd ka Lnd 

Cc ~ ~ : — a / . \y.iZ) 

propagation time Jm/k V m 



To make this expression more meaningful, let's convert the numerator and 
denominator, which are in terms of microscopic (atomic) quantities, into 
macroscopic quantities. To do so, divide by fl bond /« bond in the square root: 



^ fl bond/ fl bond _ I ^/ fl bond 
m /«bond V "'/"bond ’ 



(9.13) 



The numerator k /a honA is, as we saw in Section 9.1.2, the Young's modulus Y. 
The denominator is the mass per molecular volume, so it is the substance's 
density p. Thus, c s ~ ^]Y/p . Our physical, spring model therefore confirms 
our estimate for c s in Section 5.5.4 based on dimensions analysis. 



9.2 Energy reasoning 

The analysis of sound propagation in Section 9.1.3 required estimating the 
spring forces. Often, however, the forces or their effects are harder to track 
than are the energies. Then, as you'll see in the next examples, we track 
the energy and look for the energy signature of a spring: the quadratic 
dependence of energy on displacement. 
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9.2.1 Oscillation frequency of a spring-mass system 



To illustrate the energy method, let's practice on the most fa- 
miliar spring system by finding its natural (angular) oscilla- 
tion frequency 0 Jq. The method requires finding its kinetic 
and potential energies. Because these energies vary in complicated ways, 
we use typical or characteristic energies. 



In terms of the amplitude A 0 , the typical potential energy is comparable to 
L4 q. The typical kinetic energy is comparable to mv 2 , where v is the typical 
speed of the oscillating mass. This speed is comparable to A 0 co 0 , because 
the mass travels a distance comparable to A 0 in the characteristic time 1 /ay 
(which corresponds to 1 radian, or approximately one-sixth of an oscillation 
period). Thus, the typical kinetic energy is comparable to mA^co^. 



In spring motion, kinetic and potential energy interconvert, so the ratio 
typical potential energy 



typical kinetic energy 



(9.14) 




should be comparable to 1. This bold conclusion is not limited to spring 
motion. For example, for gravitational orbits, the ratio, defined carefully 
using the time-averaged energies, is —2. More generally, the virial theorem 
says that, with a potential V r n , the energy ratio will be 2/n. 

Equating the typical energies gives an equation for co 0 : 

kAy ~ mAy (O q. (9.15) 

F F ~ 

^potential ^kinetic 



The amplitude A Q divides out — another illustration that a spring's period 
is independent of amplitude — giving a> () ~ sjk/m . Because the energy ratio 
is 1 (due to the virial theorem), the missing dimensionless prefactor is 1. 



9.2.2 Vibrations of a piano string 

From springs to strings: A piano string is a steel wire stretched close to its 
breaking point — the high tension makes the string's resistance to bending 
less important and the sound cleaner (as you investigated in Problem 9.17). 
When you push a piano key, a hammer bangs on the string and sets it into 
vibration — whose frequency we'll estimate with a spring model. 

For a physical model, start with an unstretched piano string of length L. 
It is a bundle of springs and masses, so it acts like one large spring. Now 
stretch the string by putting it under tension T. Then hammer it. 
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The hammer gives the atoms vertical velocities. Eventu- 
ally the kinetic energy turns into potential energy, and 
the string gets a sinusoidal shape with a wavelength 
A = 2L and a small amplitude y 0 . 

As we did for the simple spring-mass system (Section 9.2.1), we'll find the 
typical potential energy in the string and the typical kinetic energy in the 
motion of the string. The potential energy comes from the tension force: 
The curved string is longer than the equilibrium string, so the tension force 
has done work on the string by stretching it; the string stores that work as 
its potential energy. The work done is the force times the distance, so 

^potential ~ T x extra length. (9.16) 

To estimate the extra length, lump a piece of the curved 
string as the hypotenuse of a right triangle with base A, 
where A = A/2 tt. This base represents 1 radian of the 
sine-wave shape. In 1 radian, a sine wave attains almost 
its full height (sin 1 « 0.84), so the height of the triangle 
is comparable to the amplitude y 0 . The triangle then has slope tan 8 « i/ (l /A . 
Because y 0 « A, the opening angle 8 is small; thus, tan 8 « 8 and 8 « y 0 / A. 



string 




A 




Using our lumping triangle, we'll find the fractional change 
in length between the hypotenuse and the base. Fractional 
changes, being dimensionless, require less algebra and are 
more widely applicable than absolute changes. To find the 
fractional change, rescale the triangle so that the base has unit length; then 
it has height 8 and hypotenuse -V 1 + 6 2 . Because 8 is small, the square root 
is, by the binomial theorem, approximately 1 + 8 2 /2. Thus, the fractional 
increase in length is comparable to 8 2 , which is y^ /A 2 . 




The fractional increase applies to the whole string, whose length was L, so 

. 2 



extra length ~ L 



(Vo 

u 



(9.17) 



The work done in making this much stretch, which is the potential energy 
in the string, is T x extra length, where T is the tension, so 

, 2 



-'potential 



TL ( 



(9.18) 



As befits a spring, even a giant one composed of individual bond springs, 
the potential energy is proportional to the square of the amplitude y 0 . 
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Now let's estimate the kinetic energy in the motion of the string. As the 
string vibrates at the so-far-unknown angular frequency to, the pieces of 
the string move up and down with a typical speed toy 0 . Thus, 

E k inetic ~ mass x (typical speed) 2 ~ pb 2 L x to 2 y 2 , (9.19) 

m ~v 2 



where p is the string's density and b is its diameter. The kinetic energy is 
also proportional to the squared amplitude y ^ — the other energy signature 
of a spring. Equating the energies gives the equation for to: 

'¥)• 



pb 2 Jrrco 2 y?f ~ TH 



(9.20) 



^ potential 



The length L and the squared amplitude y 2 } cancel, leaving 



1 

W = A 



T 

pb 2 



(9.21) 



Despite the extensive use of lumping, this result turns out to be exact — as 
do many energy-based spring analyses. The circular frequency / = to jin 
has the same structure: 

f = xjw 1 ' 19221 

The wave propagation speed is /A (or toK), which is just - jT/pb 2 . Let's 
check that this speed makes sense. As a first step, it can be rewritten as 




Because the numerator T /b 2 is the pressure (force per area) applied to the 
ends of the string to put it under the tension T, the speed of these transverse 
waves is ^ applied pressure/p . (They are called transverse waves because 
the direction of vibration is perpendicular, or transverse, to the direction 
of travel.) This speed is analogous to the speed of sound that we found in 
Section 5.5.4, J pressure /p, where the pressure was the gas pressure or the 
elastic modulus. So our speed makes good sense. 



How long is the middle-C string on a piano? 



We can find the length from the propagation speed and the frequency. The 
frequency of middle C is roughly 250 hertz. The propagation speed is 
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v = 




(9.24) 



where cr is the stress T/b 2 . This stress is also eY, where e is the strain (the 
fractional change in length). Using u — eY and c s = ^jY/p , 




Thus, in terms of the Mach number M, which measures speeds relative to 
the full wave speed, transverse waves have a Mach number of ~le . 



In steel, c s « 5 kilometers per second. For piano wire, which is made of 
high-strength carbon steel, the yield strain is roughly 0.01. However, the 
string is not stretched quite so far. To provide a margin of safety, the strain 
e is kept to roughly 3 x 10 . Then the transverse-wave speed is 



0.06 x 5 x 10 3 ms 1 = 300 m s 1 . 

fe c s 



(9.26) 



At / « 250 hertz, the wavelength is roughly 1.2 meters: 

300 ms -1 
/ " 250 Hz 



A=? 



= 1.2 m. 



(9.27) 




The wavelength of this lowest, fundamental frequency is twice the string's 
length, so the string should be 0.6 meters long. To check, I looked into our 
piano: 0.6 meters is almost exactly the length of the strings set into motion 
when I play middle C. 



9.2.3 Musical notes from bending beams 

Another musical device that we can model is a wooden or metal slat in a 
marimba or xylophone. Using spring models, proportional reasoning, and 
dimensional analysis, we'll find how the frequency of the 
note depends on its dimensions. 

Thus, imagine a thin block of wood of length /, width w, 
and thickness h. It is supported at the two dots (or held at 
one of them) and tapped in the center. As it vibrates, its 
shape varies from bent to straight and back to bent. Here 
are the shapes shown in side view. 




slat's musical 

tap 




l 
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How does the slat's width w affect the frequency? 

The answer comes from that cheapest kind of experi- 
ment, a thought experiment. Using a type of argument 
developed by Galileo in his study of free fall, lay two 
identical slats of width w side by side. Tapping both 
slats simultaneously produces the same motion as does tapping a slat of 
width 2 w (the result of gluing the two slats along the long, thin edge). So 
the width cannot affect the frequency. 

How does the slat's thickness h affect the frequency? 

The slat, made of atoms connected by bond springs, acts 
like a giant spring-mass system. When the slat is straight 
(the equilibrium position), it has zero potential energy. The 
energy increases upon bending the slat, which stretches or 
compresses the bond springs. Thus, the slat resists bending. As befits a 
giant spring, its resistance to bending can be represented by a stiffness or 
spring constant k. (Mechanical engineers define a related quantity called 
the bending stiffness or the flexural rigidity, which has dimensions of en- 
ergy times length. Our stiffness is an actual spring constant, with dimen- 
sions of force per length.) 

Then, as befits a giant spring-mass system, the slat has a vibration fre- 
quency comparable to fkjrn , where m is its mass. Deciding how the thick- 
ness affects the frequency has split into two smaller problems: how the 
thickness affects the mass and how the thickness affects the stiffness. The 
first decision is not difficult: The mass is proportional to the thickness. 

How does the stiffness k depend on thickness? 





To answer this proportional-reasoning question, we'll perform the thought 
experiment of bending each slat by the same vertical deflection y. 




The stiffness k is proportional to the force F required to bend the beam 
(F = —ky). However, we won't try to understand k by finding how the 
thickness affects F itself. Force is a vector, so finding the required force re- 
quires carefully bookkeeping many little forces and their directions to know 
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which contributions cancel. Instead, we'll find how the thickness affects the 
stored energy (the potential energy). As a positive scalar quantity, potential 
energy has no direction or even sign and is therefore easier to bookkeep. 

Because a slat is a big spring, the energy required to produce the vertical 
deflection y is given by 

E ~ ky 2 . (9.28) 

Because y is the same for the slats (an easy condition to enforce in a thought 
experiment), the energy relation becomes the proportionality E oc k. To 
find how k depends on thickness, let's redraw the bent slats with a dotted 
line showing the neutral line (the line without compression or extension): 




Above the neutral line, the bond springs along the length of the block get 
extended; below the neutral line, they get compressed. The compression or 
extension A l determines the energy stored in each bond spring. Then the 
stored energy in the whole slat is 

^ ~ ^typical spring X ^springs* (9.29) 

Because E oc k, 



k oc ^typical spring X ^springs* (9.30) 

To find how E typical spr i n g depends on the slat's thickness h, break the energy 
into factors (divide and conquer): 

^typical spring ~ ^bond X (^0 typical spring - 

Because the bonds in the two blocks are the same (the blocks differ only in 
thickness), this relation becomes the proportionality 

^typical spring ^ (^0 typical spring' 

Therefore, the stiffness is 

k oc (A l ) 2 ... x N mrinav (9.33) 

To find how (A/) typicalspring depends on h, compare typical bond springs in 
the thick and thin blocks — for example, a spring halfway from the neutral 
line to the top surface. Because the thick block is twice as thick as the thin 
block, this spring is twice as far from the neutral line in absolute distance. 
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The extension is proportional to the distance from the neutral line — as you 
can guess by observing that it is the simplest scaling relationship that pre- 
dicts zero extension at the neutral line (or try Problem 9.1). In symbols, 

( A 0 typical spring “ h - (9-34) 

Furthermore, N springs is also proportional to h. Therefore, k oc h 3 : 

k oc (A l)} . , .x AL nriri .,„ oc /j 3 . (9.35) 

v ' typical spring springs v ’ 



Doubling the thickness multiplies the stiffness by eight! The vibration fre- 
quency of a slat, considered as a giant spring, is 




The mass is proportional to h, sow® h: 




(9.37) 



Doubling the thickness should double the frequency. To test this prediction, 
I tapped two pine slats having these dimensions: 

' 1 cm ( h for the thin slat); 

. 2 cm ( h for the thick slat). 



30 cm x 5 cm x 



(9.38) 



/ 



To measure the frequencies, I matched each slat's note to a note on a pi- 
ano. The thin slat sounded like C one octave above middle C. The thick slat 
sounded like A in the octave above the thin block. The interval between the 
two notes is almost an octave or a factor of 2 in frequency. 

Now let's extend the analysis to the xylophone slats, which vary not in thick- 
ness but in length. This extension bring us to the third scaling question. 

How does the slot's length l affect the frequency? 



We already found the scaling relation between to (frequency) and w (width), 
namely co oc w°; and between co and h (thickness), namely co oc h. By 
adding the constraints of dimensional analysis to these scaling relations, 
we can find the scaling relation between co and l . 

The quantities relevant to the frequency co are the speed of sound c s and 
two of the three dimensions: thickness h and length /. The third dimension, 
the width, is not on the list, because the frequency, we already found, is 
independent of the width. (Instead of the speed of sound, the list could 
include the Young's modulus Y and the density p. As the only two variables 
containing mass, Y and c s would end up combining anyway to make c s .) 
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These four quantities, made from two dimensions, make 
two independent dimensionless groups. The first group 
should be proportional to the goal co. Because co and c s 
are the only quantities containing time, each as T -1 , the 
group has to contain co/c s . Making to/c s dimensionless 
requires multiplying by a length. Either of the two lengths works. Let's 
choose /. (For the alternative, try Problem 9.2). Then the group is col/c s . 



CO 


j-i 


frequency 


C s 


LT -1 


sound speed 


h 


L 


thickness 


l 


L 


length 



The other group should not contain the goal co. Then the only choices are 
powers of the aspect ratio h/l. If we use h/l itself, the most general dimen- 
sionless statement is 




The thickness scaling relation, co °ch, determines the form of /, giving 



col h 

c s * ' 



(9.40) 



Solving for the frequency. 



co ~ 



c s h 
l 2 ‘ 



(9.41) 



As a scaling relation, co oc l~ 2 . Let's check the scaling expo- 
nent against experimental data. When my older daughter was 


C 


l (cm) 


/(Hz) 


small, she got a toy xylophone from her uncle. Its (metal) slats 


12.2 


261.6 


have the tabulated dimensions and frequencies. The lower and 


D 


11.5 


293.6 


higher C notes (C and C') are a factor of 2 apart in frequency. If 


E 


10.9 


329.6 


the scaling relation is correct, C should come from the longer 


F 


10.6 


349.2 


slat, and the ratio of slat lengths should be *12. Indeed, the 


G 


10.0 


392.0 


measured length ratio is almost exactly 42 « 1.414: 


A 


9.4 


440.0 


* 1.419. (9.42) 

8.6 cm 


B 


8.9 


493.8 


C' 


8.6 


523.2 



Problem 9.1 Spring extension versus distance from the neutral line 

Consider a bent slat as an arc of a circle, and thereby explain why the spring exten- 
sion is proportional to the distance from the neutral line. 

Problem 9.2 Alternative dimensionless group 

Repeat the dimensional analysis for the dependence of the oscillation frequency on 
slat length but using wh/c s and h/l as the two independent dimensionless groups. 
Do you still conclude that co oc 1~ 2 ? 
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Solving the beam differential equation gives the vibration frequency as 



/ 



3.56 h 

m XCs i r 



(9.43) 



The dimensionless prefactor 3.56 /VT2” is almost exactly 1. This example is 
one of the rare cases where using circular frequency (/) rather than angular 
frequency (oj) makes the prefactor closer to 1. 

For my block of pine, a light wood, p « 0.5p water and Y ~ 10 10 pascals, so 



c s = 




IQ 10 Pa 
0.5 x 10 3 kgm 



-3 



4.5 km s 



-l 



(9.44) 



For the thin block, h = 1 centimeter and l = 30 centimeters, so 
h _ _i 10“ 2 ' 



/ 



n 2 



4.5 x 10 3 ms 



to - 1 



■ m 



rm 



450 Hz. 



(9.45) 



This estimate is reasonably accurate. The thin block's note was approxi- 
mately one octave above middle C, with a frequency of roughly 520 hertz. 



Problem 9.3 Graphing the data on frequency versus length 

Check the scaling to oc l~ 2 by plotting the xylophone data for frequency versus 
length on log-log axes. What slope should the graph have? 

Problem 9.4 Finding the stiffness, then the frequency 

Use dimensional analysis to write the most general dimensionless statement con- 
necting stiffness k to a slat's Young's modulus Y, width w, length l, and thickness 
h. What is the scaling exponent q in k oc ah? Use that scaling relation and k oc h 3 
to find the missing exponents in k ~ Y^w^Vh 3 . 

Problem 9.5 Xylophone notes 

If you double the width, thickness, and length of a xylophone slat, what happens 
to the frequency of the note that it makes? 




Problem 9.6 Location of the node 

Here's how you can predict the location of the node 
(where to hold the wood block) using conservation. 

Because the bar vibrates freely without an external 
force, the center of mass (the dot) stays fixed. Ap- 
proximate the bar's shape as a shallow parabola, find the center of mass (CM), and 
therefore find the node locations (as a fraction of the bar's length). My daughter's 
longest xylophone slat is 12.2 centimeters long with holes 2.7 centimeters from the 
ends. Is that fraction consistent with your prediction? 
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9.3 Generating sound, light, and gravitational radiation 

The sound generated by the vibrating wood blocks is an example of the 
most pervasive spring: radiation. It comes in three varieties. Electromag- 
netic radiation (or, more informally, light) is produced by an accelerating 
charge. Sound (acoustic radiation) can be produced simply by a changing 
but nonmoving charge (such as an expanding or contracting speaker mem- 
brane). Therefore, sound is simpler than light — which in turn is simpler 
than gravitational radiation. Do the easy cases first: We'll first apply spring 
models to sound (Section 9.3.1). By adding the complexity of motion, we'll 
extend the analysis to light (Section 9.3.2). Then we'll be ready for the com- 
plexity of gravitational radiation (Section 9.3.3). 

9.3.1 Acoustic radiation from a charge monopole 

When we think of radiation, we think first of electromagnetic radiation, 
which we see (pun intended) everywhere. To analyze sound radiation while 
benefiting from what we know about electromagnetic radiation, we'll find 
an analogy between electromagnetic and acoustic radiation — starting at the 
source of radiation, namely a single charge (a monopole). 

The search for the acoustic analog of charge is aided by scaling relations. An 
electric charge q produces a disturbance, the electric field E. Their connec- 
tion is E oc q. Because the symbols E and q amplify the mental connection to 
electromagnetism, let's write the relation between E and q in words. Words 
promote a broader, more abstract view not limited to electromagnetism: 

field oc charge. (9.46) 

Another transferable scaling comes from the energy density £ (energy per 
volume) in the field. For an electric field, £ <x E 2 . In words, 

energy density oc field . (9.47) 

Sound waves move fluid, and motion means kinetic energy. Because the 
kinetic-energy density is proportional to the fluid velocity squared, the 
acoustic field could be the fluid velocity v itself. Then our first scaling rela- 
tion, that field ex charge, becomes 

v oc charge. (9.48) 

Thus, an acoustic charge moves fluid and with a speed proportional to the 
charge. In contrast to electromagnetism, acoustic charge cannot measure a 
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fixed amount of stuff. For example, it cannot measure simply the volume of 
a speaker. A fixed volume would produce no motion and no acoustic field. 



Instead, acoustic charge must measure a change in the source. As an ex- 
ample of such a change, imagine an expanding speaker. As it expands, it 
pushes fluid outward. The faster it expands, the faster the fluid moves. To 
identify the kind of change to measure, let's work backward from the elec- 
tric field £ of a point electric charge to the velocity field v of a point acoustic 
charge. The electric field points outward with magnitude 



E = 



9 

47re 0 r 2 



(9.49) 



Furthermore, the electrostatic e () appears in the energy density 

G = ie 0 E 2 , (9.50) 

whose acoustic counterpart is the kinetic-energy density 




Because E and v are analogous, the electrostatic c 0 corresponds, in acoustics, 
to the fluid density p. Therefore, in the electric field, let's replace e 0 by p, 
E by v, and q by acoustic charge to get 



acoustic charge 
4npr 2 



(9.52) 



or 

acoustic charge = pv x \nr 2 . 



(9.53) 



Here, r is the distance from the charge, and v is the fluid's speed outward 
(just as the electric field points outward). Then each factor in the acoustic 
charge has a meaning, as does the product. The factor pv is the mass flux: 

flux = density of stuff x speed. (9.54) 

pv ~p V 

The factor 47rr 2 is the surface area of a sphere of radius r . Thus, the acoustic 
charge pv x 47ir 2 measures the rate at which mass flows out of this sphere. 

The acoustic charge itself, at the center of this sphere, must displace mass at 
this rate. So the acoustic analog of charge is a mass source. The source could 
be an expanding speaker that directly forces fluid outward. Alternatively, it 
could be a hose supplying new fluid that forces the old fluid outward. For 
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the rate, a convenient notation is M : The dot represents the time derivative, 
turning mass into a mass rate — which is the charge strength. 



field 

source strength (charge) 
field from a point source 



acoustics electrostatics 

fluid velocity v electric field E 

M q 

Ml q 1 

4np r 2 47re 0 r 2 



We have found an acoustic field v proportional to r~ 2 . But, as we found 
in Section 5.4.3, the signature of radiation is that the field is proportional 
to r -1 . So, we have constructed the acoustic analog of a static electric field 
and charge, but we have not yet constructed a radiating acoustic system. 

Producing radiation requires change — for example, due to a speaker. As a 
model of a speaker, a small pulsating sphere grows and shrinks in response 
to the music that it broadcasts. Maybe you put the sphere in a fancy box and 
slap a brand name on it, but growing and shrinking is still its fundamental 
operating principle and how it makes sound. A simple model of this change 
is spring motion — a sinusoidal oscillation in the charge: 

M = M 0 cos cot. (9.55) 

At t = 0, when cos cot = 1, the speaker is expanding at its maximum rate, 
displacing mass at a rate M 0 . At t = n / co, the speaker is contracting at its 
maximum rate. Here is one cycle of its oscillation. 




u’t = 0 u’t = — u’t = n u’t = — u’t = 2n 
2 2 

How much power does this changing acoustic charge radiate? 

This analysis requires easy cases and lumping. Easy cases will help us find 
the velocity field; because the charge and field are changing, lumping will 
help us find the resulting energy flow. The easiest case is near the charge: 
News about the changes requires no time to arrive, so the fluid responds to 
a changing M instantaneously. In this region, we know v: 

M 

Anpr 2 



v = 



(9.56) 
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Far from the charge, however, the fluid cannot know right away that M has 
changed. But what does "far" mean? As we learned in Chapter 8, easy 
cases are defined by a dimensionless parameter, so the distance from the 
source is not enough information by itself to decide between near and far. 

The decision needs a comparison length. This length is based on how the 
news is transmitted: It travels as a sound wave, so it has speed c s . Because 
the change happens at a rate co (the angular frequency of the charge oscil- 
lation), the characteristic timescale of the changes is r = 1/co. In this time, 
the charge changes significantly, and the news has traveled a distance c s r 
or ~ c s / co . This distance is the reduced wavelength A of the sound wave 
produced by the speaker (A = A/2n). 

Therefore, "near the charge" (in the near field or zone) ^ " 
means r « A. "Far from the charge" (in the far field or ' s ^ 

the radiation field or zone) means r » A. For example, ' y near \ far 

for middle C (/ « 250 hertz, and A w 1.3 meters), near 1 charge • zone 1 zone 

and far are measured relative to 20 centimeters. N , 

\ / 

With A as the comparison length, the dimensionless ratio x „ ___ ^ ' 

determining whether r is small or large is r/A, which is 
rco/c s . At the boundary between the near and far zones, r ~ A. 

In the lumping model, the velocity field in the near zone follows the changes 
in M instantly, with energy flowing outward and inward in rhythm with the 
speaker's motion. At the zone boundary, at r ~ A, the velocity field changes 
its character. It becomes a signal describing those changes, and this signal, 
a sound wave, travels outward at the speed of sound c s . 



To estimate the power carried by this signal — which is the power radiated 
by the source — start with the power per area, which is energy flux. At the 
zone boundary, r ~ A, so 

energy flux = energy density (at r ~ A) x propagation speed. (9.57) 

pv 2 /2 c s 



To estimate the energy density at r ~ A, return to the lumping approx- 
imation — that the velocity field tracks the changes in M throughout the 
near zone — and gather enough courage to extend the assumption. Assume 
that the instantaneous tracking happens all the way out to the zone bound- 
ary — that is, it applies not just when r « A but even when r ~ A (where the 
field abruptly changes its character and becomes a radiation field). 

In this approximation, the velocity field at r ~ A is 
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M 

V 

AnpA 2 

so the energy density £ ~ pi; 2 / 2 becomes 
M 



P 1 



47rpA" 



(9.58) 



(9.59) 



The radiated power P is the energy flux times the surface area of the sphere 
enclosing the near zone: 

2 



47iA 2 



1 ( M 

x Att? 



c c . 



(9.60) 



surface area energy density pv 2 /2 propagation speed 

Using A = cjoo, the power radiated by our acoustic monopole becomes 

jj _ 1 M 2 co 2 

P mono P ole g^ ^ ■ 

Despite the absurd number of lumping approximations, this result is exact! 
Let's use it to estimate the acoustic power output of a tiny speaker. 

How much power is radiated by an R = 1 centimeter speaker whose radius varies 
by +1 millimeter atf = 1 kilohertz (roughly two octaves above middle C)? 



This calculation becomes slightly simpler if we replace M by pV, 
where V is the rate of volume change. (In acoustics, V is often called 
the source strength Q — for example, in the classic work The Physics 
of Musical Instruments [15, p. 172]. However, for the sake of the anal- 
ogy with electromagnetism, it is more consistent to make the source 
strength M rather than V .) In terms of V, 

1 P V 2 co 2 

monopole g^ ^ ■ 

Here, V — V 0 cos tot, where V () is the amplitude of the oscillations in V. 
Therefore, the power P monopo i t , is also oscillating. By symmetry, the aver- 
age value of cos 2 cot is 1 /2 (Problem 3.38), so the time-averaged power is 
one-half the maximum power: 

_ i pl/ 2 a ; 2 
av s 16tt c s ' 

To find the amplitude V 0 , write V in terms of the speaker dimensions: 



±lmm 




(9.62) 



(9.63) 
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V = 4nRf x u surface , (9.64) 

surface area 

where t> SLirfacc is the outward speed of the speaker membrane. Because the 
surface is oscillating like a mass on a spring with amplitude A 0 , its maxi- 
mum velocity is A 0 co and its time variation is 

*Wace(0 = A 0 W COS Vt. (9.65) 



Then 

V = 4nR 2 A 0 co cos cot. (9.66) 

The corresponding amplitude is everything except the cos cot : 

y 0 = AttR 2 A 0 CO. (9.67) 



The radiated power is then 



^o 2 



P = 

avg 



p (4 tzR 2 A 0 co) 2 co 2 npR^A 2 ^ 



1671 



(9.68) 



By multiplying P avg by c 3 /c 3 , the power can be written in terms of the di- 
mensionless ratio Rco/c s , which is also R/A: 




Physically, R/A is the dimensionless speaker size (measured relative to A). 
The scaling exponent of 4 tells us that the radiated power depends strongly 
on the dimensionless size of the speaker. As a result, big speakers (large R) 
are loud; and long wavelengths (low frequencies) require big speakers. 

For this speaker, the radius is R is 1 centimeter, the surface-oscillation am- 
plitude A 0 is 1 millimeter, and / is 1 kilohertz. So the wavelength of the 
sound is roughly 30 centimeters (A = cjf), and A is roughly 5 centimeters. 
Then the dimensionless speaker size R/A is approximately 0.2, and 

P avg ~ 3 x lkgm -3 x (3 x 10 2 ms -1 )' 2 x (l0 _3 m) 2 x 0.2 4 . (9.70) 

'IT ~p cl ~Al (KMP 

To evaluate this power mentally, divide and conquer as usual: 

1. Units. They are watts: 

kgm -3 x m 3 s -3 x m 2 = kgm 2 s -3 = W. 

2. Powers often. They contribute 10 -4 : 



(9.71) 
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IQ 6 X 10~ 6 X 1Q~ 4 = 10- 4 . (9.72) 

fromh* from Tig from (R/A) 4 

So far, the power is 10 -4 watts. 

3. Remaining numerical factors. They are 

3 x 3 3 x 2 4 . (9.73) 

from 7X from from (R/A) 4 

2 

Because 2 4 = 16 and 3 x 3 3 is (3 2 ) or roughly 10 2 , the remaining numer- 
ical factors contribute roughly 1600. Let's round it to 2000. 

Then the power is roughly 2000 x 10 -4 or 0.2 watts. 

Does that power represent a loud or a soft sound? 

It depends on how close you stand to the speaker. If you are 1 meter away, 
the 0.2 watts are spread over a sphere of area 4zr x (1 meter) 2 , or roughly 
10 square meters. Then the power flux is roughly 0.02 watts per square 
meter. In decibels, which is the more familiar measure of loudness (intro- 
duced in Problem 3.10), this power flux corresponds to just over 100 deci- 
bels, which is very loud, almost enough to cause pain. 

What radius fluctuations would produce a barely audible, 0-decibel flux? 

Shrinking the flux to 0 decibels, which is a drop of 100 decibels, is a drop 
of a factor of 10 10 in energy flux and power. Because the energy flux is 
proportional to A 2 , A 0 must fall by a factor of 10 5 : from 10 -3 meters to 
10 -8 meters. Thus, 10-nanometer fluctuations in a small speaker's radius 
are (barely) enough to produce an audible sound. The human ear has an 
amazing dynamic range and sensitivity. 



9.3.2 Electromagnetic radiation from a dipole 



In the spirit of laziness, let's transfer, by analogy, the radi- 
ated power from acoustic to electromagnetic radiation. In 
Section 9.3.1, we developed the analogy between acoustics 
and electrostatics. Using it, the acoustic radiated power, 

n 1 M 2 cv 2 



monopole 



(9.74) 



87T pc s 

implies an electromagnetic radiated power of q 2 io 2 /8nc 0 c. 



acoustics 


electrostatics 


P 


€ 0 
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E 
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Alas, this conjecture has three problems. First, if it represents the power ra- 
diated by an oscillating charge — moving, say, on a spring oscillating with 
frequency a ; — then its acceleration is oc a' 2 , so the radiated power is pro- 
portional to the acceleration. However, we learned from dimensional analy- 
sis (Section 5.4.3) that the power had to be proportional to the acceleration 
squared. Second, the power should depend on the amplitude of the motion, 
which is a length, yet the proposed power contains no such length. 

These two problems are symptoms of the third problem, that transferring 
the acoustic analysis to electromagnetism has made an illegal situation. A 
single changing electromagnetic charge Cj(t) violates charge conservation: 
If q(t ) is increasing, from where would the new charge come? 

This question suggests the valid physical model, that the new charge 
comes from a nearby charge. As a model of this flow, imagine a pair 
of opposite, nearby charges. As the positive charge flows to the negative 
charge, the two charges swap places; and vice versa. This model is an 
oscillating dipole. Here is a full cycle of its oscillation. 

© 0 © 

© © © 

u’t = 0 wt = n utt = 2n 



© 

flow 

© 



If the charges are ±q and their separation is Z, then ql is called their dipole 
moment d. Here, the time-varying dipole moment is d(t) = d {) cos cot, 
where d 0 is the amplitude of the oscillations in the dipole moment. To 
estimate the power radiated, we'll reuse the structure from acoustics. 



P 



477T 2 x -e 0 E 2 x c, 

surface area energy density propagation speed 



(9.75) 




A 



1 




i 



and evaluate it at the near-field-far- field boundary (r ~ A). The only change 
from acoustics is that the electric field E is not the field from a single point 
charge (a monopole source) but rather from two opposite charges (a dipole 
source). Let's evaluate their field at the position of a test charge at r ~ A. 

The two charges (the monopoles) contribute slightly different electric fields 
E + and E_. Because these fields are vectors, adding them correctly requires 
tracking their individual components. Let's therefore make the lumping 
approximation that we can add the vectors using only their magnitudes: 

Edipole ~E + -E_. 



(9.76) 
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This approximation would be exact if the vectors lay along the 
same line — which they would if the dipole were an ideal di- 
pole, with zero separation (l = 0). By making this approxima- 
tion even for this nonideal dipole, we will obtain an important 
and transferable insight about the field of a dipole. 

Because the distances from the test charge to the monopoles 
are almost identical, the two fields E + and E_ have almost the 
same magnitude. Therefore, the difference E + — E_ is almost 
zero. (The key word is almost. If the difference were exactly 
zero, there would be no light and no radiation.) Let's approxi- 
mate the difference using a further lumping approximation. 




The dipole field is the difference AT = E(r + ) — E(r_), where E(r) is 
the monopole field. The difference is approximately 

A £ « E'(r) x A r , (9.77) 

rise slope run 

where A r = r_ — r + . (This formula ignores a minus sign, but we 
are interested only in the magnitude of the field, so the sign doesn't 
matter.) In Leibniz's notation, the slope E'(r) is also dE/dr. Using 
the lumping approximation of Section 6.3.4 (which I remember as 
d ~ d), the d s cancel and dE/dr, and therefore E'(r), is roughly E/r. 




AE 



The trickiest factor is A r, which is r_ — r + . It depends on the charge separa- 
tion / and on the position of the test charge relative to the dipole's orienta- 
tion. When the test charge is directly above the dipole (at the north pole), 
A r is just the charge separation /. When the test charge is at the equator, 
A r = 0. With our lumping approximation, A r is comparable to /. 



Then the difference AE, which is the dipole field, becomes 
F ~ F y L 

^dipole ^ monopole A ‘ * 



(9.78) 



The 1/r factor comes from differentiating the monopole field. The factor of Z, 
the dipole size, turns the derivative of the field back into a field and makes 
the overall operation dimensionless: Making a dipole from two monopoles 
dimensionlessly differentiates the monopole field. 



Because the energy density 8 in the electric field is proportional to E 2 , and 
the power radiated is proportional to the energy density, the power is 

. 2 



dipole ~ monopole 



(9.79) 
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In evaluating the radiated pozver, what r should we use? 



The power radiated is determined by the energy density at the boundary 
between the near- and far-field regions: at r ~ c/co. With that substitution, 

■ 2 - q 2 l 2 co* 



1 



2 2 



dipole 



87t e 0 c 



l 

c/co 



87T 



CnC° 



(9.80) 



1 monopole 



With a 1 /6tt instead of 1 /8n, this result is exact. Furthermore, if the source 
is a single accelerating charge, instead of a charge flow, then its acceleration 
is comparable to lev 2 , and P dipole ~ c/^/CqC 3 , which is now consistent with 
what we derived in Section 5.4.3 using dimensional analysis. 

In terms of the dipole moment d = ql, 

n 1 co*d 2 

Pdi P° le ~ (9 ' 81) 

Dipole radiation is the strongest kind of electromagnetic radiation. In Sec- 
tion 9.4, well use the dipole power to explain why the sky is blue and a 
sunset red. 



Problem 9.7 Lifetime of hydrogen if it could radiate 

Assuming that the ground state of hydrogen could radiate as an oscillating dipole 
(because of the orbiting electron), estimate the time r required for it to radiate its 
binding energy E 0 . The ground state of hydrogen is protected by quantum mechan- 
ics — there is no lower-energy state to go to — but many of hydrogen's higher-energy 
states have a lifetime comparable to r. 



9.3.3 Gravitational radiation from a quadrupole 

Having started with acoustics and practiced with electromagnetics, we can 
extend our analysis of radiation to gravitational waves — without solving 
the equations of general relativity. In acoustics, radiation could be pro- 
duced by a monopole (a point charge). In electromagnetics, radiation could 
be produced by a dipole but not by a monopole. Building a dipole requires 
charges of two signs. Because the gravitational equivalent of charge is mass, 
which comes in one sign, there is no way to make a gravitational dipole. 

Therefore, gravitational radiation requires a quadrupole. A quadrupole is 
to a dipole what a dipole is to a monopole. It is two nearby dipoles with 
opposite strengths — so that their fields almost cancel. 
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An example is an oblate sphere. Relative to a sphere, it is 
fat at the equator (represented by the + signs) and thin at 
the poles (represented by the — signs). One plus-minus 
pair forms one dipole. The other plus-minus pair forms 
the second dipole. They have the same magnitude but 
point in opposite directions. As the sphere shifts to pro- 
lateness (tall and thin), the signs of the charges flip, as do 
the directions of the dipoles. 




Just as the dipole field is the dimensionless derivative of the monopole field 
(Section 9.3.2), the quadrupole field is the dimensionless derivative of the 
dipole field. Thus, if the pulsating object has size Z, so that the two dipoles 
are separated by a distance comparable to ?, then at the boundary between 
the near and far fields (at r ~ c/co), the fields are related by 



-'quadrupole 



-'dipole 



c/co 



= £ 



col 



dipole 



(9.82) 



The radiated powers are related by the square of the extra factor: 

2 



quadrupole 



P X 

1 dipole 



col 

c 



(9.83) 



That dimensionless ratio in parenthesis has a physical interpretation. Its 
numerator col is the characteristic speed of the objects making the field. Its 
denominator c is the wave speed. Their ratio is the Mach number M, so 
col/c is the characteristic Mach number of the sources. In terms of M, 

Equadrupole ~ ^dipole X ^ • (9.84) 

For the radiated power from an electromagnetic dipole, we found an anal- 
ogous relation: 

^dipole ~ ^monopole x M 2 . (9.85) 

In general, 

£ 2 "'-pole ~ ^monopole x M 2m , (9.86) 

where a 2°-pole is a monopole, a 2 1 -pole is a dipole, and so on. 

With the analogy between electrostatics and gravity (from Section 2.4.2), we 
can convert the power radiated by an electromagnetic dipole to the power 
radiated by a gravitational dipole, if it existed. Then we just adjust for the 
difference between a dipole and a quadrupole. From the analogy, the elec- 
trostatic field E is analogous to the gravitational field g, and electrostatic 
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charge q is analogous to mass m. Finally, because g = Gm/r 2 is analogous 
to E = q/Ane 0 r 2 , the electrostatic combination 1 /4 ttc 0 is analogous to New- 
ton's constant G. 



For electromagnetic dipole radiation, the power radiated is 
1 q 2 l 2 cv 4 



dipole 



67re n 



(9.87) 



Replacing 1 /47re 0 with G and q with m, but leaving c alone because gravita- 
tional waves also travel at the speed of light (a limitation set by relativity), 
the power radiated by a gravitational dipole, if it existed, would be 



p Gm 2 l 2 <o 4 

1 dipole ^3 



(9.88) 



Changing from dipole to quadrupole radiation adds a factor of cvl/c to the 
field and ( cvl/c ) 2 to the power, so 



P 



quadrupole 



Gm 2 ! 4 w 6 



(9.89) 



Let's use this formula to estimate the gravitational power radiated by the 
Earth-Sun system. We'll divide the system into two sources. One source is 
the Earth as it rotates around the system's center of mass (CM). The other 
source is the Sun as it rotates around the system's center of mass. 




system Earth as source Sun as source 



Which source generates more gravitational radiation? 



The two sources share the constants of nature G and c. Because they orbit 
around the center of mass like a spinning dumbbell, they also share the 
angular velocity a). Therefore, the power simplifies to a proportionality 
without G, c, or co : 

-^quadrupole “ ™ 2 ! 4 , (9-91) 

where m is the mass of the object (either the Earth or Sun) and l is its dis- 
tance from the center of mass. Furthermore, ml is shared, because the center 
of mass is defined as the point that makes ml the same for both objects. 

^ Carth * ^Earth-CM distance ^5un X ^Sun-CM distance - (9.92) 
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By factoring out two powers of ml and discarding them, the proportionality 
further simplifies from P quadmpole °c m 2 / 4 to P quadrupo i e °c l 2 - The Earth, 
with the longer lever arm Z, generates most of the gravitational wave energy. 
Equivalently, we can factor out four powers of ml and get Pquadmpole “ ni~ 2 ; 
the Earth, with the smaller mass, still wins. So, 



quadrupole 



Gm Lth l4u > 6 



(9.93) 



As the last simplification before evaluating the power, let's eliminate the 
angular frequency to. For motion in a circle of radius Z, the centripetal ac- 
celeration is v 2 /l (as we found in Section 5.1.1). In terms of the angular 
velocity, this acceleration is co 2 l. It is produced by the gravitational force 



r GMsu n m Eart h 

? 2 



(9.94) 



(approximately, because Z is slightly smaller than the Earth-Sun distance). 
The resulting centripetal acceleration is F/m Eaith or GM Sun /Z 2 , so 



co 2 l = 



GM Sun 

Z 2 



(9.95) 



Using this relation to replace (to 2 l) 3 in P qua d rU poie with (GM Sun /? 2 ) 3 gives 



P 



quadrupole 






l 5 C 5 



(9.96) 



The power based on a long and difficult general-relativity calculation is al- 
most the same: 



quadrupole 



32 G 4 , 

5 l 5 c 5 mEarth ' 



'ATsun) 2 ( m Earth + M. 



Sun 



)• 



(9.97) 



With the approximation that m Earth + M Sun ~ M Sun , the only difference be- 
tween our estimate and the exact power is the dimensionless prefactor of 
32/5. Including that factor and approximating m Earth + M Sun by M Sun , 



quadrupole 



32 G 4 ^Earth M S 3 un 
5 Z 5 C 5 



(9.98) 



To avoid exponent whiplash and promote formula hygiene, let's rewrite the 
power using a dimensionless ratio, by pairing another velocity with the c in 
the denominator. It would also be helpful to get rid of G, which seems like 
a random, meaningless value. To fulfill both wishes, we again equate the 
two ways of finding the Earth's centripetal acceleration, as the acceleration 
produced by the Sun's gravity and as the circular acceleration v 2 /l: 
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GM Sun _ v^_ 

l 2 l ' 



Therefore, GM Sun /Z = u 2 and 



/ 4 



Sun/ 

U 8 



(9.99) 



(9.100) 



That substitution gives 

p 32 mirth i/ 8 

quadruple ~ 5 fc5 • 



(9.101) 



The ratio v 5 /c 5 is M 5 , where M is the Mach number of the Earth (its orbital 
velocity compared to the speed of light). Of the three remaining powers of 
v, one power combines with / in the denominator to give back the angular 
velocity co = v/l. The remaining two powers of v combine with one power 
of m Eart h to give (except for a factor of 2) the orbital kinetic energy of the 
Earth. The remaining masses make the Earth-Sun mass ratio m Earth /M Sun . 
Therefore, in a more meaningful form, the power is 

^quadrupole « y M 5 X ™Earth^ 2 <*>■ (9-102) 



In this processed form, the dimensions are more obviously correct than they 
were in the unprocessed form. The factors before the x sign are all dimen- 
sionless. The factor of niy anh v 2 is an energy. And the factor of co converts 
energy into energy per time — which is power. 

Now that we have reorganized the formula into meaningful chunks, we are 
ready to evaluate its factors. 

1. The mass ratio is 3 x 10 -6 : 



ffl Earth „ 6xl0 24 kg _ 

M Sun ~ 2x1030 kg 



(9.103) 



2. 



The Mach number M = v/c turns out to have a compact value. The 
Earth's orbital velocity v is 30 kilometers per second (Problem 6.5): 



circumference 
orbital period 



2tt x 1.5 x 10 11 m 
7 t x 10 7 s 



3xl0 4 ms 4 , 



(9.104) 



which uses the estimate from Section 6.2.2 of the number of seconds in 
a year. The corresponding Mach number is 10~ 4 : 

M = » = 3 x 10 4 ms" 1 = in _ 4 
— c 3 x 10 8 ms -1 



(9.105) 




9.4 Effect of radiation: Blue skies and red sunsets 



345 



3. For the factor m Varih v 2 , we know m Earth and have just evaluated v. The 
result is 6 x 10 33 joules: 

6 x 10 24 kg x 10 9 m 2 s -2 = 6 x 10 33 J. (9.106) 

"'Earth V 2 

4. The final factor is the Earth's orbital angular velocity uj. Because the 
orbital period is 1 year, co = 2n/\ year, or 

O n-r 

— = 2 X 10 -7 S _1 . (9.107) 

n x 10 7 s 



With these values, 

37 

Pquadrupole ~ X 3 X 10 -6 X 10 -20 X 6 X 10 33 J X 2xlO _7 S _1 . (9.108) 

"'Earth/Msim M 5 "'Earth 1 ’ 2 W 



The resulting radiated power is about 200 watts. At that rate, the Earth's 
orbit will not soon collapse due to gravitational radiation (Problem 9.8). 

Quadrupole radiation depends strongly on the Mach number P source /c, and 
the Earth's Mach number is tiny. However, when a star gets captured by a 
black hole, the orbital speed can be a large fraction of c. Then the Mach 
number is close to 1 and the radiated power can be enormous — perhaps 
large enough for us to detect on distant Earth. 

Problem 9.8 Energy loss through gravitational radiation 

In terms of a gravitational system's Mach number and mass ratio, how many or- 
bital periods are required for the system to lose a significant fraction of its kinetic 
energy? Estimate this number for the Earth-Sun system. 

Problem 9.9 Gravitational radiation from the Earth-Moon system 

Estimate the gravitational power radiated by the Earth-Moon system. Use the 
results of Problem 9.8 to estimate how many orbital periods are required for the 
system to lose a significant fraction of its kinetic energy. 



9.4 Effect of radiation: Blue skies and red sunsets 

We have developed almost all the pieces and tools to understand our final 
two phenomena: blue skies (Section 9.4.1) and red sunsets (Section 9.4.2). 
The only missing piece is the amplitude of a spring-mass system when it is 
driven by an oscillating force. We'll build that piece where it is first needed, 
and then put all the pieces together. 
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9.4.1 Skies are blue 



The light that we see from a clear sky is dipole radiation 
scattered from air molecules (on the Moon, with no at- 
mosphere, there are no blue skies). Plenty of sunlight 
does not get affected by air molecules, but unless we look 
at the sun, which is almost always very hazardous, the 
direct sunlight does not reach our eye. (The exception is 
just at sunset, and in Section 9.4.2 we'll use the exception 
to help explain why sunsets are red.) 




The analysis makes the most sense as a causal sequence from the sunlight 
to this scattered radiation. Sunlight is an oscillating electric field. The elec- 
tric field exerts a force on the charged particles in an air molecule (say, in 
N 2 ). The charged particles, the electrons and protons, form spring-mass 
systems, and the electric force accelerates the masses. Accelerating charges 
radiate, which is the radiation that reaches our eye. By quantifying the 
steps in the sequence from sunlight to scattered radiation, we'll see why 
the scattered radiation looks blue. 



1. Electric field of the sunbeam. Sunlight contains many colors of light, each 
simple to describe as an electric field (divide and conquer!): 

E(co) = E 0 (co) COSCOt, (9.109) 

where co is the angular frequency corresponding to that color. For ex- 
ample, for red light, co is 3 x 10 15 radians per second. The amplitude 
Eq(co) depends on the intensity of the color in sunlight, so E{co) is a 
distribution over co, and it and the amplitude E 0 (co) have dimensions 
of field per frequency. However, as a simple lumping approximation, 
think of seven separate fields, one for each color in the color mnemonic 
Roy G. Biv: red, orange, yellow, green, blue, indigo, and violet. 

For the color of the sky, what matters is the relative distribution of col- 
ors in the sunbeam and how that relative distribution is different in scat- 
tered light. Therefore, let's do our calculations in terms of an unknown 
E 0 and, in the spirit of proportional reasoning, determine the depen- 
dence of E 0 on co in the scattered light. 

2. Force on the charged particles. The electric field produces a force on the 
electrons and protons in an air molecule. Using e for the electron charge 
and ignoring signs, 

F = eE = eE 0 cos cot, 



(9.110) 
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3. Amplitude of the motion of the charged particles. Because protons are much 
heavier than an electrons, the electrons move faster and farther than the 
protons and produce most of the scattered radiation. Therefore, let's 
assume that the protons are fixed and analyze the motion of an electron. 

The electron, which is connected to its proton by 
a spring, is driven by the oscillating force F. For- 
tunately, we do not need to solve for the motion 
in general, because we can use easy cases based 
on the ratio between the driving frequency co and the system's natural 
frequency co 0 (which is fk/m e , where k is the bond's spring constant). 
The three regimes are then (1) co « co 0 , (2) co = co 0 , and (3) co » co 0 . 

To decide which regime is relevant, let's make a rough estimate of how 
the two frequencies compare. For air molecules, the natural frequency 
co 0 corresponds to ultraviolet radiation — the radiation required to break 
the strong triple bond in N 2 . The driving frequency co corresponds to 
one of the colors in visible light (sunlight), so the electron's motion is in 
the first, low-frequency regime co « co 0 . (For the analysis of the other 
regimes, try Problems 9.12 and 9.15.) 

The low-frequency regime is easiest to study in the co = 0 extreme. It 
represents a constant force F = eE 0 pulling on the electron and stretch- 
ing the electron-proton bond. When there is change, make what does not 
change! The bond stretches until the spring force balances the stretching 
force eE 0 . The forces balance when the stretch is x — F/k or eE 0 /k. 




Because co = 0, the force has been constant since forever, so the bond 
stretched to its extended length back in the mists of time. When co is 
nonzero, but still much smaller than co 0 , the bond still behaves approx- 
imately as it did when co = 0: It stretches so that the spring force bal- 
ances the slowly oscillating force F. Therefore, the stretch, which is the 
displacement of the electron, is 



x(t) 



eE n 



■ cos cot. 



( 9 . 111 ) 



4. Acceleration of the electron. We can find the acceleration using dimen- 
sional analysis. In driven spring motion, the important length is the dis- 
placement x, and the important time is 1/co. Therefore, the acceleration, 
which has dimensions of LT _ , must be comparable to xco 2 . Because we 
used the angular frequency (co rather than /), the dimensionless prefac- 
tor turns out to be 1. Then the electron's acceleration is 
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a(t) = x(t)co 2 



eE 0 co 2 



cos cot. 



(9.112) 



Because the spring constant k is related to the electron mass m e and to 
the natural frequency co 0 by k = m e co q. 



0(0 



eE, 



o or 



m e co, 



cos cot. 



o 



(9.113) 



5. Power radiated by the accelerating electron. We found in Sections 5.4.3 and 
9.3.2 that the power radiated by an accelerating charge e is 



Pdi P° le 67TC 0C 3- 

For the squared acceleration a 2 , we'll use the time average to find the 
average radiated power. Because the time average of cos 2 cot is 1/2, 



<« z > 



2 \ _ 



2 m 2 co 



Then 

P. 



1 e 4 E 2 



dipole 



127re 0 c 3 m 2 



- 

"o/ 



(9.115) 



(9.116) 



This mouthful is useful in explaining the color of sunsets (Section 9.4.2), so 
it has been worthwhile carrying lots of baggage on the trip from sunlight to 
scattered light. But to explain the color of the daytime sky, we can simplify 
the power using proportional reasoning. Because e 0 , c, co 0 , m e , and e are 
independent of the driving frequency co (which represents the color), 

^dipole « E 2 CO 4 . (9.117) 

The factor of Eq is proportional to the energy density in the incoming sun- 
light (at frequency co), which itself is proportional to the incoming power 
in the sunlight (also at the frequency co). Thus, 

^dipole ^ -^sunlight ^ ■ (9.118) 



Let's review how the four powers of co got here. For low frequencies — and 
visible light is a low frequency compared to the natural electronic-vibration 
frequency of an air molecule — the amplitude of spring motion is indepen- 
dent of the driving frequency. The acceleration is then proportional to co 2 . 
And the radiated power is proportional to the square of the acceleration, so 
it is proportional to co 4 . 
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Therefore, the air molecule acts like a filter that takes in (some of the) in- 
coming sunlight and produces scattered light, altering the distribution of 
colors — similar to how a circuit changes the amplitude of each incoming 
frequency. However, unlike the low-pass RC circuit of Section 2.4.4, which 
preserves low frequencies and attenuates high frequencies, the air molecule 
amplifies the high frequencies. 

Here are the sunlight and the scattered spectra based on the a’ 4 filter. Each 
spectrum shows, by the area of each band, the relative intensities of the var- 
ious colors. (The unlabeled color band between red and yellow is orange.) 




Sunlight looks white. In the scattered light, the high-frequency colors such 
as blue and violet are much more prominent than they are in sunlight. For 
example, because c^biue/^red ~ 1-5 and 1.5 4 « 5, the blue part of the sun- 
light is amplified by a factor of 5 compared to the red part. As a result, the 
scattered light — what comes to us from the sky — looks blue! 

9.4.2 Sunsets are red 

As sunlight passes through the atmosphere, ever more of its energy gets 
taken in by air molecules and then scattered (reradiated) in all directions. 
As we found in Section 9.4.1, this process happens more rapidly at higher 
(bluer) frequencies, due to the factor of a; 4 in the radiated power. There- 
fore, the sunbeam becomes redder as it travels through the atmosphere. If 
the beam travels far enough in the atmosphere, sunsets should look red. 
To estimate the necessary travel distance, we can adapt the analysis of Sec- 
tion 9.4.1. Then we will use geometry to estimate the actual travel distance. 

How far must sunlight travel in the atmosphere before the beam looks red? 

To estimate this length, let's estimate the rate at which energy gets scattered 
out of the beam. The beam carries an energy flux 

F = ^e 0 E o c ' 



(9.119) 
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which has the usual form of energy density times propagation speed. (The 
electric and magnetic fields each contribute one-half of this flux.) To mea- 
sure the effect of one scattering electron on the beam, let's compute one 
electron's radiated power divided by the flux: 



dipole 



1 






127re 0 c 3 






Wo 



\ e o^c. 



(9.120) 



L dipole 



Noticing that the (unknown) field amplitude E 0 cancels out and doing the 
rest of the algebra, we get 



P 



dipole 



F 



8tz f e 2 1 \ 2 
3 \Ane 0 m e c 2 ) 



x 



area (T 




(9.121) 



The left side, Pdi po uW / is power divided by power per area. Thus, P ( |i po | e /E 
has dimensions of area. It represents the area of the beam whose energy 
flux gets removed and scattered in all directions. The area is therefore called 
the scattering cross section a (a concept introduced in Section 6.4.5, when 
we estimated the mean free path of air molecules). 



On the right side, the frequency ratio u>/co 0 is dimensionless, so the prefac- 
tor must also be an area. It is called the Thomson cross section <r T : 



CT j — 



8 71 



1 



4716 , 



o m e c z 



7x10 



- 29 m 2 . 



(9.122) 



Because cr T is an area, the factor inside the parentheses must be a length. 
Indeed, it is the classical electron radius r 0 of Problems 5.37 and 5.44(a). It 
is comparable to the proton radius of 10 

,2 l 



-15 



meters: 



r o = 



4716, 



0 w e c 



2.8 x 10 -15 m. 



(9.123) 



It approximately answers the question: For the electron to acquire its mass 
from its electrostatic energy, how large should the electron be? This elec- 
tron's cross-sectional area is, roughly, the Thomson cross section. In terms 
of the Thomson cross section, our scattering cross section a is 

(w V 

U = £7y . (9.124) 

Wo/ 



Each scattering electron converts this much area of the beam into scattered 
radiation. As a rough estimate, each air molecule contributes one scattering 




